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A newly launched online services firm typically desires to build a large user base for its website and must 
market itself to potential users that are not aware of its services. Once some users have entered the online firm, 
network effects, which are a result of features such as social interactions, content sharing etc., typically help 
to sustain the users’ interest in the online firm. Marketing and network effects thus complement each other 
in the process of building the user base. This work is the first to study the interactions between marketing 
and network effects in determining the online firm’s optimal marketing policy. We build a model in which the 
online firm starts with an initial user base and controls the growth of the user base by choosing the intensity 
of advertisements and referrals to potential users. A large user base provides more profits to the online firm, 
but building a large user base through advertisements and referrals is costly; therefore, the optimal policy 
must balance the marginal benefits of adding users against the marginal costs of sending advertisements and 
referrals. Our work offers three main insights: (1) The optimal policy prescribes that a new online firm should 
offer many advertisements and referrals initially, but then it should decrease advertisements and referrals 
over time. (2) If the network effects decrease, then the change in the optimal policy depends heavily on two 
factors i) the level of patience of the online firm, where patient online firms are oriented towards long term 
profits and impatient online firms are oriented towards short term profits and, ii) the size of the user base. 
If the online firm is very patient and if the network effects decrease, then the optimal policy prescribes it 
to be more aggressive in posting advertisements and referrals at low user base levels and less aggressive in 
posting advertisements and referrals at high user base levels. On the other hand, if the online firm is very 
impatient and if the network effects decrease, then the optimal policy prescribes it to be less aggressive in 
posting advertisements and referrals at low user base levels and more aggressive in posting advertisements 
and referrals at high user base levels. (3) The change in the optimal policy when network effects decrease 
also depends heavily on the heterogeneity in the user base, as measured in terms of the revenue generated 
by each user. An online firm that generates most of its revenue from a core group of users should be more 
aggressive and protective of its user base than a firm that generates revenue uniformly from its users. 
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1. Introduction 

1.1. Motivation 

When an online services firm is launched it desires to build a large user base, but many potential 
users in the market may not be aware of the firm’s website and services. Therefore, the online firm 
needs to reach out and market itself to potential users by posting advertisements and by giving 
referrals. As users become aware of the online firm they will enter the firm’s website and then 
generate benefits for the firm by paying subscription fees, clicking on advertisements, etcQ. Users 
generate these benefits through the duration of their stay on the firm’s website and hence the 
online firm desires not only to have a large user base but also to have its users remain with it for 
sufficiently long. Network effects, which result from social features like chatting with friends, user 
feedback etc., help to sustain each user’s interest in the online firm and thus impact the length of 
time users continue to utilize its services. Network effects thus complement the force of marketing 
in the process of building a user base. However, no systematic models currently exist that can 
inform our understanding of how these two forces interact. For instance, the impact of the strength 
of network effects on the firm’s optimal policy is not well understood, for instance, how should 
an online firm change its marketing decisions under different magnitudes of network effects. In 
this paper, we develop a general model that considers both marketing and network effects, and we 
determine the optimal marketing policy a firm should adopt to maximize its revenue. 

1.2. Model overview 

We develop a dynamic continuous-time model for the user base growth of an online services firm. 
The online firm starts with a small initial user base and markets to potential users by posting 
advertisements and by giving referrals. The potential users can also become aware of the online 
firm through other methods, for e.g. word-of-mouth, which are not controlled by the online firm. 
The online firm desires a large user base in order to procure more revenue, but building its user 
base through marketing is costly; therefore, it must design a policy that balances the marginal 
benefits of adding users against the marginal costs of marketing. We assume that the users arrive 
in the decreasing order of preference, which is proportional to the benefit a user generates for the 
website. This leads to online firm’s revenue per unit time to be increasing and concave in the size 
of its user base, with the rate at which the benefit saturates reflecting the degree of heterogeneity 
in the user base. Users stay on the online firm for random durations, which due to network effects 
will increase in expectation as the user base size grows. The strength of the network effects is 
represented by the extent to which a growth in the user base influences the expected stay duration 
of each user. We assume that if the user base of an online firm increases then the number of friends 

^ See http://finance.yahoo.com/news/must-know-assessingfacebook- revenue-170009607.html 
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of each user on the online firm should increase as well. The average time that a user interacts with 
each friend affects the increase in the stay duration of the user for every friend that is added and 
thus quantifies the strength of network effects in our model. Therefore, the user base dynamics 
of the users on the online firm are determined by three distinct factors: the intensity of direct 
arrivals through word-of-mouth, the intensity of sponsored arrivals through advertisements, and 
the expected duration that each user stays on the online firm, which is impacted by the network 
effects. The online firm designs its policy taking future benefits into consideration, which depend 
on the user base dynamics of the online firm. The online firm discounts future benefits, with the 
extent of discounting depending on the duration the online firm expects to be in the market. 

1.3. Results 

We show that if the online firm starts with a small initial user base, then the optimal policy is 
unique and prescribes the online firm to give many advertisements and referrals initially and then 
decrease them over time. The optimal policy and the corresponding user base trajectory eventually 
converge to a unique steady state. Note that the optimal policy prescribes the online firm to 
decrease advertisements and referrals despite the fact that advertisements and referrals given at 
a later time can potentially be more beneficial; at later times, the user base is larger and each 
incoming user stays for a longer duration due to network effects. The factors that contribute to 
this result are: i) the marginal benefits decrease as the number of users increases, which is due to 
the concavity of the benefit function, ii) the stay time of the users is bounded above and saturates 
as the user base size increases. 

Given the volatility of online settings, the strengh of network effects can change rapidly as new 
competitors or avenues for social interactions emerge, and so it is important to understand how 
the optimal policy is affected by a decline in the network effects. We show that if there is a decline 
in network effects, then the prescriptions of the optimal policy depend heavily on two factors: i) 
the online firm’s patience level and ii) the size of the user base relative to an upper bound, which 
is an optimistic estimate of the online firm’s user base in steady state. If there is a decline in 
network effects and if the online firm is very patient, then the optimal policy will prescribe to be 
more aggressive in advertising and sending referrals when the user base is very small relative to 
the upper bound, while it will prescribe to be less aggressive in advertising and sending referrals 
when the user base is close to the upper bound. This result is reminiscent of the concept of barriers 
to entry in marketing. A very patient online firm operating at a small user base, which means 
that the market of the online firm is not saturated, increases the advertisements and referrals to 
create barriers to entry for the new online firms that cause a decline in its network effects. For 
instance, Netflix, which represents a very patient firm operating in a market (global market) that 
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is not saturated, increased investment to expand its user basen and counter the increasing number 
of other rival firms. On the other hand, if there is a decline in network effects and if the online 
firm is very impatient, then the optimal policy will prescribe to be less aggressive in advertising 
and sending referrals when the user base is very small relative to the upper bound, while it will 
prescribe to be more aggressive in advertising and sending referrals when the user base is close to 
the upper bound. 

The above comparison shows that the level of patience together with the size of the user base 
is crucial in determining the prescribed optimal policy. Interestingly, we also show that the level 
of heterogeneity in the user base measured in terms of the revenue generated by the users is 
also very important in determining optimal policy. To the best of our knowledge, this is the first 
time that the impact of heterogeneity in the revenue generated by the user base on the optimal 
marketing policy has been systematically studied. We show that if there is a decrease in the network 
effects, the optimal policy prescribes an online hrm with a very heterogeneous user base to increase 
its advertisements and referrals in the steady state. Examples of online hrms that have a very 
heterogeneous user base are online gaming hrms such as Zynga, where most of the users play for 
free and a very small fraction actually make in game purchases that contribute to revenue [j. Zynga 
witnessed a decline in network effects 0 due to the rise of competing website King0 and increased 
investments 0 in response to the decline in network effects, which is similar to the prescriptions 
of the optimal policy in our model. On the other hand, we show that if there is a decrease in 
network effects, then the optimal policy prescribes a hrm with a very homogeneous user base to 
decrease its advertisements and referrals in the steady state. Examples of online hrms that have 
a very homogeneous user base are subscription based online hrms such as Nethix, where all the 
users need to pay the subscription fee to use the service. 

In practice the online hrm cannot perfectly predict the user arrivals and exits since the users’ 
behavior is uncertain. We analyze how should the prescriptions of the optimal policy change to 
account for this uncertainty. We model the uncertainty in the user arrivals per unit time using 
a standard geometric Brownian motion process, which captures the uncertainty of user arrivals 
and exits. We show that if there is high uncertainty in user arrivals, then the optimal policy will 
prescribe to be less aggressive in advertising and sending referrals. We also show that if there is 


^ http;//www.forbes.com/sites/petercohan/2015/01/21/4-reasons-to-invest-in-netflix/ 

® http://venturebeat.com/2014/02/26/only-0- 15-of-mobile-gamers-account-for-50-percent-of-all-in-game-revenue-exclusive/ 
See alexa.com, https://en.wikipedia.org/wiki/Zynga 

® http;//www.forbes.com/sites/tomiogeron/2013/03/26/how-king-com-zoomed-up-the-social-gaming-charts/ 

® http://www.bloomberg.com/news/articles/2012-09-07/zynga-adds-more-developers-to-publishing-network-seeking- 


revenue 
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higher uncertainty in the user arrivals, then the optimal policy will achieve a smaller expected user 
base in steady state as well as achieve lower expected profits compared to the deterministic case 
with no randomness in arrivals/exits. Basically, this happens because the increase in the random 
arrivals cannot compensate for the increase in the random exits and the decrease in the sponsored 
arrivals. 

1.4. Key findings 

We summarize the key findings of this paper below. 

1. If the online firm starts with a small initial user base, then the optimal policy is unique and 
it prescribes the online firm to be more aggressive in advertising and send more referrals initially 
and then decrease the advertisements and referrals over time. 

2. If there is a decline in network effects, then the prescriptions of the optimal policy depend 
jointly on the level of patience and the size of the user base. The optimal policy prescribes a very 
patient (impatient) online firm to increase (decrease) the advertisements and referrals at small user 
base levels and decrease (increase) the advertisements and referrals at large user base levels. 

3. If there is a decline in the strength of the network effects, then we also show that the prescrip¬ 
tions of the optimal policy depend on the level of heterogeneity in the user base. The optimal policy 
prescribes an online firm with very heterogeneous (homogeneous) user base to increase (decrease) 
the advertisements and referrals in steady state. 

4. If the user arrivals/exits are uncertain, then we show that the optimal policy prescribes 
the online firm to be less aggressive in comparison to the case when the user arrivals/exits are 
deterministic. 

The rest of the paper is organized as follows. In Section 2 we describe the related literature. In 
Section 3 we present the model and the policy design problem of the online firm. In Section 4 we 
analyze the existence and uniqueness of the optimal policy. In Section 5 we analyze the behavior 
of the optimal policy on the path to steady state. In Section 6 we analyze the impact of change in 
network effects on the optimal policy. In Section 7 we discuss the extensions of the model and in 
Section 8 we conclude the paper. 


2. Related Literature 

2.1. Literature on advertising 

In this section we summarize the relevant works on advertising. The main difference between these 
works and our work is that these works do not incorporate network effects into their model and 
thus do not analyze the impact of n etwork effects on advertising. 


The seminal work of 


Ozgal (jl960l) was the first to study the lack of perfect knowledge about 


the market among buyers and sellers. It developed an analytical model of advertising based on 
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information diffnsion amongst the buyers. iNerlove and Arrow! (jl962l i considered a more general 
setting by incorporating the impact of advertising expenditnres on the demand of the prodnct 
into a firm’s decision making and proposed optimal advertising and pricing policies. The Bass 


model (iBass 


19691 1 was the first to analyze the process of the adoption of goods in a market at a 


micro level. It separated nsers in the market into two types, innovators and imitators and the two 
types followed different product adoption dynamics. The Bass model did not separately incorporate 
decision making such as the intensity of advertising and pricing in the market; however, the model 
has been treme ndously suc c essfn 


as discussed in 


Bass et al 


i n capturing the prodnct adopt ion dynamics in several scenarios 
(jl994l l. iRobinson and Lakhanil (|19751 1 separately incorporated pricing 


decisions of the firm into Bass’s model. They considered patient firms that derive pricing policies 
taking the evolution o f the market, i.e. demand, as predicted by Bass’s model into consideration. 


Thompson and Tend (jl984l l then considered a more general model with patient firms that derive 


joint pricing and advertising policies taking the evolution of the market into consideration. 

The above models are well snited for a market where the users generate one time benefits for 
the firm upon purchase of the product and leave. However, the rise of service-based economies 
has shifted the focns to scenarios where the relationship of the firm with the user is long-term in 
nature (for e.g., online firms, banks etc.) and this has been a snbject of investig ation recently (see 


for instance 


Gupta et al, 


20061. 


Gourio and Rudanko 


201411 . iGupta et al.l (j2006ll discusses various 


metrics based on different models that are used to evaluate the value of a user’s stay-time with the 
firm. The work compares these different metrics based on empirical insights. 

Theoretical res earch analyzing the dynamic s of a user base and its impact on the firm’s policies 


has been limited. 


Gourio and Rudankol (j2014^ analyzes the impact of search frictions in a market 


arising due to inefficiencies in acqniring nsers for the firm. They cite the enormons spending done 
by the firms for marketing and selling as evidence of such inefficiencies. They provide a micro¬ 
foundation for the search process and analyze the impact of search frictions on a firm’s profits, 
values, sales and markups. Our work makes several similar modeling assnmptions i) the user’s 
relationship with the online firm is long-term, ii) the online firm’s benefits depend on the stay 
dnration of the nsers, and hi) the process of advertising is inefficient, which canses the cost of 
advertising to be convex. A key difference between our model and all of the above works is that 
we consider the advertising policy of the online firm in conjnnction with network effects, i.e. larger 
networks positively impact the user’s stay duration on the firm’s website. Moreover, the questions 
that we investigate are different from the existing works and concern the impact of different network 
effect strengths on the optimal policy. Next, we discnss the relevant works which study the impact 
of different network effects on firm’s policies, especially with respect to pricing. 
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2.2. Literature on network effects 

In this section we summarize the relevant works that study network effects. The main difference 
between these works and our work is that these works do not incorporate advertising into their 
model and thus do not analyze the impac t of ne twork effects on advertising. 


The seminal work of 


Katz and Shanird (|19851 1 was the first to formally model and analyze the 


issue of pricing in various market scenarios where network effects play an important role. Since then, 


different scenarios (see for instance 

Candoffan et al 

. 2012 

, Lobel et al. 

2014 

). Most of the works 

er a static setting but 

) consider a dynamic 

fe.e.. Katz and Shaoiro 1985. Candoeran et al. 

2012. 

Lobe 

et al 

2014 

) consic 

some of the recent works (e.g.. 

Markovich and Moenius 

2009, 

Cabral 

2011 


setting where the users choose which network to enter while taking the pric e set by the network, the 
preference for the network and the size of the network into consideration. iFudenberg et al.l (119991 ) 
consider the dynamics of an incumbent and an entrant in a market with network externalities, and 
they show that the incumbent can set lower prices in the equilibrium due to the competition faced 
from the entrant. In each of these works, the authors do not model the advertising policy of the 
online firm, which will impact the rate at which users become aware of the network. To the best 
of our knowledge there is a gap between the literature on advertising and network effects. In this 
work we take an important first step to bridge this gap. 


3. Model 

In this section we propose a model to understand how aggressively an online services firm should 
post advertisements and send referrals in order to influence its user base dynamics. The two key 
features of the model are: i) the user base dynamics are impacted by network effects and ii) the users 
generate different revenues for the online firms, which causes the user base to be heterogeneous. 

3.1. User arrivals and exits 

We consider a setting with a continuum of potential users and an online firm which markets itself 
to these users by posting advertisements on other online firms (e.g., Facebook, Google etc.) or 
by giving referrals. New users visit the online firm either through sponsored media (advertise¬ 
ments/referrals) or arrive directly through exogenous methods, such as directly visiting the online 
firm or arriving at the online firm after hearing about it from others. We refer to these exogenous 
arrivals as direct arrivals and denote them as f{x), where x is the size of the user base of the 
online firm. Note that they are not directly influenced by the online firm. We assume that f{x) 
is increasing, non-negative, continuously differentiable and bounded above and below by positive 
constants and respectively. f{x) is increasing because when the user base is large more 
users become aware of the online firm and thus enter it. 
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Each user who visits the online firm will stay on it and then eventually exit the online firm after 
some random time, which is determined by a Poisson arrival process that starts along with the 
arrival of the user. At the first arrival of the process the user leaves. The exit can be interpreted, for 
instance, as the user losing interest in the online firm. The rate of Poisson arrivals is given by 
and hence, the stay time has an exponential distribution with a mean given by fi-\-r]g{x). The first 
term fj represents the expected stay time in the absence of the network effects; it is the average 
time a user will spend independent of the presence of others. The second term rjg{x) represents 
the expected stay time due to the network effects; it is the average time a user will spend on social 
interactions. The term ry represents the average time that a user spends per interaction and hence, 
it is a measure of the strength of network effects. Note that g is a fixed constant that depends on 
the type of the online firm (for e.g., Netflix features much less social interactions than Facebook) 
and the presence of other available venues for social interaction. We assume that ry is bounded, i.e. 
0 < ry < ry®“P. The term g{x) represents the average number of interactions that a user carries out. 
We assume that g{x) is non-negative, increasing, continuously differentiable and bounded above by 
g{x) is increasing because when the user base is large each user will have more opportunities 
to interact. We denote the size of the user base that is using online firm’s services at time t as 
x{t), where x : [0,oo) —)■ [0,oo) is the user base trajectory of the users. We restrict our attention 
to user base trajectories that are absolutely continuous. The initial user base x(0) is denoted by 
x{0) = x.niUai, which is assumed to be small, 0 < Xintuai < /™^?y. 

3.2. Online firm’s benefits and costs 

The size of the user base x is proportional to the unique user statistic, which is often used as a 
metric to evaluate an online firm’s popularity and hence, it is critical to the revenue generated by 
the online firm 0. we assume the. eaeh potential nset haa a «.ed ptefetence f„t the online fit. 
that is proportional to the benefit that the user will generate for the online firm. We assume that 
the users enter the online firm in a decreasing order of their preference. We can show that the 
size of the user base x on the online firm is sufficient to determine the preference distribution of 
the users on the online firm and thereby the benefit function (see Appendix J). Hence, the benefit 
generated by the online firm per unit time with a user base of size x is denoted as b{x), where 
6 ; R —)• [0, oo) is a continuously differentiable increasing function. In fact it can also be shown that 
the benefit function b{x) is concave (see Appendix J). The rate at which b{x) saturates reflects 
the heterogeneity in the user base in terms of the revenue generated by the users. Next, we give 
an example of a concave benefit function and show that the rate of saturation of b{x) reflects the 
extent of heterogeneity in the user base. 

http://www.pcmag.com/encyclopedia/term/53438/unique-visitors 
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Example- Consider an online firm and a user base of potential users, which has a total mass 
of 1. Each user has a preference for the online firm w drawn from a uniform distribution [0,1] 
independent of others. Preference uj represents the willingness to pay for services of the firm. A 
user with preference u will generate a benefit of for the online firm upon joining the online 

firm, where 0 < a < 1. If a is close to 1, then the user base is homogeneous, i.e. users generate 
similar benefits for the online hrm. For example, subscription based hrms such as Netflix have the 
same subscription fee for all the users. If a is close to 0, then the user base is heterogeneous, i.e. 
different users generate different benefits. For example, online gaming hrms such as Zynga have 
signihcant heterogeneity in benehts generated by users. Only 0.15 percent of the users generate 
50 percent of the revenue for Zynga (Seelfl). We assume that the users join the online hrm in the 
order of their preferences. We also assume that each user who joins the online hrm will not leave. 
Note that this assumption is made for a better exposition and can be relaxed. The total beneht 
generated by a user base of x users, where x < 1 is b{x) = * 

The long-term beneht of the online hrm considering a discount rate of p can be computed as 
B{x) = b{x{t))e~P*dt, where x is a succinct representation for the entire trajectory of x{t) for 
all t>0. Note that i represents the discount factor and is proportional to the expected duration 
of stay of the online hrm in the market. We refer to a online hrm with a high discount factor as 
patient and an online hrm with a low discount factor as impatient. 

The online hrm chooses the intensity at which it advertises itself and sends referrals and as a 
result it controls the rate of sponsored arrivals A(t), where A : [0,oo) —^ [0,A®“^] is an absolutely 
continuous and Lebesgue measurable function. The online hrm bears a cost per unit time for the 
advertisements, which increases with the intensity of sponsored user arrivals A(t). It is given by 
c(A(t)), where c : M —>■ [0,oo) is a continuously differentiable and increasing function. The online 
hrm needs to post a large numbe r of advertisements in orde r to hnd users interested in using its 
services. Hence, we assume (as in iGourio and Rudankol 120141 1 that the cost c(A) is strictly convex 


in A. The long-term discounted average cost to the online hrm is C'(A) = c{X{t))e~^*dt, where 

A is a succinct representation for the entire policy A(t) for all t > 0. The overall rate of change of 
the user base on the online hrm is 

1 


— =/(x)-|-A(f) — 7 _ 
dt {r] + r]g{x)) 


X 


The hrst two terms in the differential equation represent the rate of direct and sponsored arrivals 
respectively and the third term represents the rate at which the users leave the online hrm. Note 
that the third term is deterministic despite the fact that each user exits randomly. This is due to 


® http://venturebeat.com/2014/02/26/only-0-15-of-mobile-gamers-account-for-50-percent-of-all-in-game-revenue- 
exclusive/ 
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Variables 

Notation 

Control type 

Expected stay time at time t 

V + V9{x{t)) 

Indirect 

User base size at time t 

x{t) 

Indirect 

Direct arrivals at time t 


Indirect 

Expected stay time (network effects part) at time t 

r]g{x{t)) 

Indirect 

Average number of social interactions per user 

9{x{t)) 

Indirect 

Flow benefit at time t 

b{x{t)) 

Indirect 

Long-term discounted average benefit 

B{x) 

Indirect 

Sponsored arrivals at time t 

m 

Direct 

Flow costs at time t 

c{\{t)) 

Direct 

Long-term discounted average cost 

C{\) 

Direct 

Network effect strength 

9 

No control 

Expected stay time (independent part) 

V 

No control 

Initial user base 

^initial 

No control 

Discount rate 

p 

No control 

Upper bound on direct arrivals 

jsup 

No control 

Lower bound on direct arrivals 


No control 

Upper bound on strength of network effects 

jjSUp 

No control 

Upper bound on social interactions per user 

gSUp 

No control 


Table 1 Classifying the variables in the model based on the control type 


the fact that there is a continuum of users and each user will leave the online firm with probability 
fj+r]g(x) ™ time interval At, with At small, independent of other users. Hence, the number 
of users who leave the online firm in At time is -, ^ , . xAt. Table 1 summarizes the notation used 
in this paper and classifies the variables based on the type of control that the online firm has on 
each of them (direct, indirect or no control). 

While in this paper we assume that the users arrive in the decreasing order of their preference, 
which results in a concave benefit function, we also extend the model and the results to incorporate 
a different order of arrivals of the users, in which the benefit functions are not necessarily concave 
(see Appendix G). We can also relax the assumption that the users are characterized by a decision 
function that leads to the direct arrivals at the rate f{x) and sponsored arrivals at the rate A(f). 
We extend the model and results to scenarios where the users take decisions to enter the online 
firm or not based on a utility function that depends on the preference of the user and the size of 
the user base (see Appendix H). 

In the next section we describe the policy design problem of the online firm. 

3.3. Online firm’s policy design problem 

The online firm desires to maximize its long-term discounted profit, i.e. B{x) — C(X). It solves for 
the optimal policy that maximizes its long-term discounted profit, which is formally stated below. 
Recall that the initial user base x(0) = xinitial is given 


Policy Design Problem 
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max 

A 


roo 

B{x) — C{X)= / {b{x{t)) — c{X{t)))e~^^dt 

Jo 


dx 1 

subject to — = f{x) + X{t) - — -^ 

dt {r] + r]g{x)) 


X 


We denote a policy that solves the above continuous-time optimization problem where 

A*.: [0,oo) —)• [0,A®“P] and we refer to it as the optimal policy as a function of time. The 


^initial 


: [0, oo) —>■ [0, oo] is 


corresponding optimal user base trajectory is denoted as x*. where x*. 
a function which maps each time instance t to the user base at that time. 

The maximum value of the long-term profit that is achieved by solving the above continuous-time 
optimization problem is denoted as Il-*{xinitial, d)- has to satisfy the Hamilton-Jacobi- 


Bellman (HJB 

to Xinitial (sCe 


equation, whi ch is given below, provided Ii*{xinitial,’rj) is differentiable with respect 


Kushner 


19681 1. Next, using the HJB equation we will compute an optimal policy 


that will depend only on the size of the user base and relate it with the optimal policy as a fun ction 
of time This approach is standard in optimal control ISee lSoever and Chund (j2008ll j. 


pn(x, r/) = max {h{x)) - c(A) + + A - ^ , , x)) 

o<A<A'>“p Ox ri + rjg[x) 

We substitute the optimal solution of the policy design problem Il*{xinitial, f]) m the HJB equa¬ 
tion. The maximizer of the right hand side in the HJB equation is given as C*{xinitiai) = 
argmaxo<A<A«“p - c(A) initial) + A - where 

C*{xinitiai) correspoiids to the optimal po licy at the initi al user base size Xinuiai- This follows from 
the Bellman’s principle of optimality fsee lxushner 1968 1. Note that this optimal policy C*{xinitiai) 
only depends on the initial size of the user base. Following the same method we can obtain an 
optimal policy at different user base sizes x using the optimal solution n*(x,ry). We can thus define 
an optimal policy as a function of the user base size, denoted as (*, where (* : [0,oo) —[0 ,A®“p] 
is a mapping from the user base size x to the optimal policy. If the optimal policy as a function 
of the user base size C* aiid the optimal policy as a function of time are unique, then 

At (0 = C*(3Jt iO) has to hold for consistency. 

^initial ^ ^ ^ ^initial ^ '' 

In the next section we will discuss the existence and uniqueness of the optimal policy as a 
function of the user base and the optimal policy as a function of time. 

4. Existence and uniqueness of the optimal policy 

In this section we hrst show that the optimal policy as a function of user base C* exists and is 


unique. Next, we show that the optimal policy as a function of time A 


^initial 


also exists and is 


unique. 
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First, we state some assumptions that are required in the rest of the paper. Note that it can 
be shown that every user base trajectory satisfying the differential equation in the policy design 
problem is always bounded above by Xu (see Appendix A for the expression) and bounded below 
by Xinitiai (see Appendix A for the proof) Therefore, in the next few assumptions we only require 
the properties stated to hold in an interval [xmiuai, Xy] ■ 


Assumption 1. 1. c'(0) < < c'(A®“p), Vx g [ximUahXu], where c'(A) 

P^xtial derivative ofIl*{x,r]) with respect to x. 


dX 


2 . 


d ^ Ii * (31,77) 


exists and is continuous with respect to x, Vx G [xinmai , x^ 


We can show that exists (see Appendix A). The first part of Assumption 1 requires 

bounded above and below, which should hold for online firms whose profits do not 
change too sharply with a change in the size of the user base x. This assumption ensures that the 
optimal policy is always in the interior of [0, A®“^]. The second part of the assumption requires that 
^ exists and is continuous, which should hold for online firms whose marginal increase in 

profits as users are added does not change too sharply with an increase in the size of the user base 
X. We discuss in Appendix A cases when Assumption 1 does not hold. In these cases the optimal 
policy will not necessarily be in the interior of the set [0, A®“^], despite that we can show that the 
flavor of the main results in this paper will not change. 

We define the rate of arrivals with no advertisements and referrals as follows h,,(x) = /(x) — 

_ 1 _ X 

Assumption 2. h,,(x) is continuously differentiable and concave in the interval [ximUaiyXy]- 


In Assumption 2 we assume that the rate of arrivals in the absence of advertisements and referrals 
hn{x) is concave. This implies that the marginal increase in the arrivals with an increase in the 
user base decreases at larger user base levels. This assumption holds for online hrms for which the 
popularity increases faster at relatively small user base levels and gradually saturates as the online 
firm becomes larger. 

Having stated the assumptions we now move to the main results of this section. 


Proposition 1. The optimal policy as a function of user base C* exists and is unique. 

For all the detailed proofs refer to Appendix given at the end. The proof of the above proposition 
is given in Appendix A. 

Proposition 2. The optimal policy as a function of time A*. ^ and the corresponding user 

base trajectory as a function of time exists and is unique. 


® The population trajectory not falling below XinUiai is a result of the deterministic model in Section 3. In Section 7 
we work with a stochastic model, where the population trajectory can fall below the initial population level. 
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The proof of the above proposition is given in Appendix A. 

Note that Proposition 2 does not follow directly from Proposition 1. To show Proposition 2 
we need to prove that once C* is uniquely determined there exists a unique solution to ^ = 
f{x) + \{t) — X for a given initial condition x(0) = Xinuiai, which will be denoted as 

Now we know that both the policy as a function of the user base ('* and the policy as a function 
of time exist and are unique. The uniqueness of the policy is important in order to compare 

the change in the optimal policy under varying network effects. Note that as we already pointed 
out the following relation will hold when the benefit and cost 

function are quadratic and f{x) + X{t) — x is linear in x, then a closed form expression can 

be obtained for the optimal policy as a function of time and optimal policy as a function of user 
base size (See Appendix I). 

In the next section we discuss the convergence of the optimal policy and also investigate the 
questions concerning how the optimal policy changes as a function of time. 

5. Optimal policy behavior and convergence to steady state 

In this section we analyze the convergence of the optimal policy as a function of time A*. ^ and 

the corresponding optimal user base trajectory . In order for us to analyze convergence we 

define the steady state of the policy as follows. 

Definition 1. The steady state of a system is a pair (As,Xs), which satisfies the following 
condition: if the initial user base in the policy design problem is Xs, then X{t) = Ag, Vt is the optimal 
policy as a function of time and the corresponding user base trajectory is x{t) = Xg Vt. 

The above definition of the steady state states that if the initial user base is at the steady state 
value then the optimal policy will correspond to the exact number of sponsored arrivals such that 
the user base stays at that fixed value. The next theorem proves that the steady state exists and 
is unique. 

Theorem 1. The steady state (Ag,Xs) of the online firm exists and is unique. 

For the proof of the above theorem refer to Appendix B given at the end. 

Theorem 1 shows that the steady state exists and is unique. If we can show that the optimal 
policy converges to the steady state, then understanding the impact of changing network effects on 
the steady state, which is unique, will serve as an important tool for analysis in the next sections. 
Next we prove that the optimal policy converges to the steady state. 

Theorem 2. The optimal policy as a function of time corresponding opti¬ 

mal user base trajectory as a function of time xl.^.^.^ft) converge to the steady state values, i.e. 
limt^oo (t) ^ Ag and lim*^^ (t) ^ Xg. 
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For the proof of the above theorem refer to Appendix C given at the end. 

The intuition behind the proof of the above theorem is straightforward. We know that the initial 
user base is smaller than the smallest steady state user base level Based on this it can be 

argued that the optimal user base trajectory is always increasing. We also know that all the user 
base trajectories are bounded above hence, the optimal user base trajectory will converge. Next, we 
analyze the behavior of the optimal policy on its path to the steady state: When the online firm is 
launched, should it give more referrals in the beginning? Or should it give more referrals at a later 
stage?. The online firm faces the following trade-off in choosing among these two approaches. The 
first approach requires the online firm to market aggressively in the beginning and achieve a large 
user base quickly. A large user base will generate more benefits for longer durations but requires 
the online firm to pay more costs for marketing. The second approach requires the online firm to 
wait for the user base to increase sufficiently and then market aggressively. The second approach 
allows the online firm to limit costs and invest later when the returns from each advertisement and 
referral will be high since the users will stay longer due to the network effects. The next theorem 
determines which approach is better. 

Theorem 3. The optimal policy as a function of time A*. ft) decreases with time t and the 
corresponding user base trajectory as a function of time increases with time t. 

For the proof of the above theorem refer to Appendix D. 

We show that the marginal increase in profit from adding users at a larger user base is lower 
and thus conclude that the first approach will be superior to the second. In the second approach, 
the online firm delays the choice of giving more referrals to achieve larger gains at a later stage. 
Due to the decrease in the marginal increase in profit from adding users, the online firm cannot 
compensate for the loss it has to bear because of a smaller user base in the earlier stages. 

In the next section we discuss the impact of the network effects on the optimal policy. 

6. Impact of a change in network effects on the optimal policy 

How does a decrease in the strength of network effects impact the prescriptions of the optimal 
policy to the online firm? A decrease in the strength of network effects can result from an increase 
in other avenues for social interactions or the emergence of other online services which leads to an 
increase in indirect competition. If there is a decrease in network effects, then the following two 
comparisons are relevant: 

1. Comparing the prescriptions of the optimal policy C*(®) same user base size x. 

2. Comparing the prescriptions of the optimal policy (^) at the same time t. 

A comparison at the same user base size is significant because the same user base size corresponds 
to same level of popularity for the online firm, while a comparison at the same time is significant 
because the same time corresponds to the same date after the launch of the online firm. 
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6.1. Impact of the network effects: comparing the optimal policy at the same size 
of the user base 

In this section we analyze the impact of a decrease in the strength of network effects on the optimal 
policy C*(3:) at the same size of the user base. Next, we state some assumptions needed exclusively 
for the results in this subsection (Theorem 4 and Corollary 1). 

We assume that c (A) has a positive lower bound . c (A) measures the additional cost that the 
online firm needs to bear for increasing the sponsored arrival rate by a unit value. The online firm 
will post more advertisements to increase the sponsored arrivals. Typically, there is a minimum 
cost for posting an advertisement, which justifies this assumption. We also assume that Yi*{x,rf) 
is continuously differentiable with respect to rj in the interior of the interval [0,7y'*“^]. In addition, 
we also assume that the rate of arrivals at Xiniuai in the absence of referrals and advertisements 
is sufficiently high h^{xinitiai) > , Vry e [0,77®“^], where u(A) = c(c'”^(A)) and c'“^(A) is the 

inverse of the function c (A). 

For the next theorem we define a lower bound p* and an upper bound p“ on the discount rate, 
where p* < p“. If /> < p* then the online firm is very patient and if p > p“ then the online firm is very 
impatient. We also define a lower bound Xi and an upper bound X 2 on the user base size, where 
^initial ^ Xi < X 2 < x^. The expressious for these thresholds are derived and stated in Appendix E. 

Theorem 4. If there is a decrease in the strength of network effects rj and 

• if the online firm is very patient (p < pf and has a sufficiently small user base x <Xi, then 
the optimal policy prescribes to increase the advertisements and referrals; 

• if the online firm is very patient (p < pf and has a sufficiently large user base x > X 2 , then 
the optimal policy prescribes to decrease the advertisements and referrals; 

• if the online firm is very impatient (p> p“J and has a sufficiently small user base x <Xi, then 
the optimal policy prescribes to decrease the advertisements and referrals; 

• if the online firm is very impatient (p> p^) and has a sufficiently large user base x>X 2 , then 
the optimal policy prescribes to increase the advertisements and referrals. 

For the proof of the above theorem refer to Appendix E. 

Before we discuss the above theorem, we point out that the result of the above theorem does 
not depend on the assumption that the benefit function is concave and this is discussed in detail 
in Appendix G. 

In the above theorem we show that the change in the optimal policy in response to a change in 
the network effects depends on the size of the user base and the level of patience. We first discuss 
the intuition for the result on very patient online firms. Examples of very patient online firms 
include subscription based online firms as these online firms provide services and expect the users 
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Firm type based on p 

Size of the user base 

Change in referrals and ads 

Very patient 

Small 

Increase 

Very patient 

Large 

Decrease 

Very impatient 

Small 

Decrease 

Very impatient 

Large 

Increase 


Table 2 Impact of the decrease in the network effects on the referrals and advertisements 


to continually use them. If there is a decrease in network effects, then it can potentially lead to 
a reduction in the user base size in the future, causing an increase in the marginal benefits from 
adding users. If the user base is sufficiently small then the increase in the marginal benefits is large 
and since the online firm is very patient the optimal policy prescribes giving out more referrals 
and advertisements such that it can achieve a large user base in the long run that compensates for 
the costs of giving more referrals and advertisements. On the other hand, if there is a decrease in 
network effects and if the online firm has a sufficiently large user base, which is close to its steady 
state level, then it does not expect to achieve a much larger user base than its existing user base in 
the long run. Therefore, the optimal policy prescribes decreasing the advertisements and referrals 
and saving the costs instead. Next, we discuss the intuition for the result with very impatient online 
firms. 

Examples of very impatient online firms are online gaming firms because games typically have a 
short life-time. If there is a decrease in network effects and if the online firm has a small user base, 
then potentially there will be a reduction in the user base size in the future. The online firm is 
impatient and will not increase the advertisements and referrals because increasing advertisements 
and referrals will lead to an increase in the user base only in the long-term, while the online firm 
values short-term benefits. Also, an increase in the advertisements and referrals will cause a large 
increase in the short-term costs. This is because when the user base is small the sponsored arrival 
rate is already high and increasing it further will be very costly. On the other hand, the opposite 
would be true if the online firm has a sufficiently large user base. At large user base the rate of 
sponsored arrivals is low hence, the marginal costs of advertising are low as well. Thus, the online 
firm increases advertisements and referrals to maintain the large user base that generates sufficient 
benefits in the short-run and compensates the costs of the increase in referrals and advertisements. 

The results of the above theorem are summarized in Table II. 

In this section we learned how online firms should change their optimal policy in reaction to a 
reduction in the network effects. We saw that the the prescriptions of the optimal policy crucially 
depend on the online firm’s level of patience and the size of the user base. Next, we state a corollary 
of Theorem 4 that compares the optimal policy A* at time t = 0 under the impact of decreasing 

^initial 

network effects. For this corollary we assume that Xinuiai < Xi in addition to the assumption in 
Section 3.1 that XinUiai < 
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Corollary 1. If there is a deerease in the strength of network effects rj and 

• if the online firm is very patient (p < pf, then the optimal policy prescribes to increase the 
advertisements and referrals at initial time instances close to t = 0; 

• if the online firm is very impatient (p> p^), then the optimal policy prescribes to decrease the 
advertisements and referrals at initial time instances close to t = 0. 

At time t = 0 the population trajectory will have the same value Xinniai and therefore, we can 
use the comparison at low population levels in Theorem 4 to derive the above Corollary. In the 
next section we compare the optimal policy at large times t —)• oo under the impact of decreasing 
network effects. 

6.2. Impact of the network effects: comparing the optimal policy at the same time 

In the previous section we already compared the change in the prescriptions of the optimal policy 
at time t = 0 (see Corollary 1). Next, we will analyze the impact of a change in network effects 
on the optimal policy at time t —)• oo, i.e. in the steady state. The results that we will discuss 
extend to sufficiently large values of time instances (due to the continuity properties of the optimal 
policy). Recall that in Section 2 we proved that the optimal policy will converge to the steady 
state denoted as (Ag,Xg). In essence, we will analyze how Ag changes when the strength of network 
effects decreases. 

Theorem 5. If there is a decrease in the strength of network effects rj and 

• if the online firm is very patient (0 < p < pf, then the optimal policy prescribes to decrease 
the advertisements and referrals in the steady state; 

• if the online firm is very impatient (p> p^), then the optimal policy prescribes to increase the 
advertisements and referrals in the steady state. 

For the proof of the above theorem refer to Appendix F. 

If the online firm is very patient, then a decrease in the network effects causes the online firm 
to increase the advertisements and referrals in the initial stages close to t = 0 (see Corollary 1) 
such that it can build a large user base and derive more benefits in the long run. As time goes 
on and the user base becomes sufficiently large the online firm decreases the advertisements and 
referrals and saves the costs as it does not expect the user base size to increase much. On the other 
hand if the online firm is very impatient, then a decrease in network effects causes the online firm 
to decrease the advertisements and referrals in initial stages close to t = 0 as the marginal costs 
for increasing the advertisements and referrals are high (at small time instances there are many 
sponsored arrivals which causes the marginal costs to be high). When the online firm is close to the 
steady state it does not want to lose the user base acquired up until that point, and it can prevent 
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Firm type (based on p) 

Change in referrals and ads 

Time 

Very patient 

Decrease 

Steady state 

Very impatient 

Increase 

Steady state 

Very patient 

Increase 

Start t = 0 

Very impatient 

Decrease 

Start t = 0 


Table 3 Impact of the decrease in the network effects on the referrals and advertisements in steady state 


the users from exiting as quickly by increasing the advertisements and referrals as the marginal 
costs for increasing advertisements are low (at large time instances there are few sponsored arrivals 
which causes the marginal costs to be low). We summarize the above comparisons in Table III. 

Until now we have discussed the impact of the online firm’s patience level on the change in 
the advertisements and referrals in the steady state. Next, we show that the type of the online 
firm as determined by the heterogeneity in its user base also plays a key role in its decisions. In 
the comparisons that follow we will focus on the impact of this heterogeneity on the online firm’s 
decisions to change the optimal policy at large values of time instances (in steady state) when the 
network effects decrease, while keeping other parameters of the problem fixed. 

We partition online firms into two categories based on the rate at which the marginal benefits 
increase when the user base decreases. Note that b (x)x is proportional to the change in the 


marginal benefits and b (x) is proportional to the marginal benefits. Therefore, ^ / is proportional 

b (x) 

the rate of change of marginal benefit. We also define constants 0, A such that 0 < A, whose 
expressions can be found in Appendix F. 


• If ^ online firm is of the heterogeneous type for which the 

marginal benefits from adding users increase quickly when the user base decreases. These online 

firms have a very heterogeneous user base, for e.g., online gaming firms. 

// 


• If > A, then the online firm is of the homogeneous type for which the 

marginal benefits from adding users increase slowly when the user base decreases. These online 
firms have a very homogeneous user base, for e.g., subscription based online firms. 


Theorem 6. If there is a decrease in the strength of the network effects rj and 

// 

• if the online firm of the heterogeneous type, i.e. sup,j,g 

policy prescribes to increase advertisements and referrals in the steady state; 

n 

• if the online firm of the homogenoeus type, i.e. inf^^g optimal 

b [x) 

policy prescribes to decrease advertisements and referrals in the steady state. 

For the proof of above theorem refer to Appendix F. 

The main conclusion from Theorem 7 is that the type of the online firm based on its benefit 
function determines its response to the change in network effects. The intuition for the above 
theorem is the following. Suppose an online firm of the heterogeneous type sees a decrease in 
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Firm type (speed of saturation of marginal benehts) 

Change in 
referrals and ads 

Time 

// 

Fast < 0 (online gaming hrms) 

Increase 

Steady state 

Slow mixe[xi^m^i,xu] ^ (subscription based online hrms) 

0 [x) 

Decrease 

Steady state 


Table 4 Impact of the decrease in the network effects on the referrals and advertisements in steady state 


network effects, which can lead to reduction in the user base. A reduction in the user base can 
cause a large increase in the marginal benefits from adding users, because some of the users who 
generate large benefits for the online hrm will have exited. Hence, by giving more advertisements 
and referrals the online hrm can avoid a sharp reduction in the user base and thus sustain the 
users who generate large benehts for a longer duration. On the other hand, if the online hrm is of 
the homogeneous type then the increase in the marginal benehts from reduction in the user base 
is not sufficiently high. Hence, the online hrm will save on costs by reducing advertisements and 
referrals. 

We summarize this comparison in Table IV. 

In the next section we discuss an extension of the model where we incorporate uncertainty in 
user arrivals/exits in the online hrm’s policy design problem. 


7. Extensions 

7.1. Impact of uncertainty in the user arrivals/exits on the optimal policy 

7.1.1. Model: In this section we model the uncertainty in the user arrivals and exits and 
reformulate the online hrm’s policy design problem, which is now aimed at maximizing the expected 
long-term prohts. A main source of uncertainty is the unpredictability in the preferences of the 
users for the online hrm. Given the analytical difficulties associated with dealing with stochasticity 
in user dynamics, we will consider a specihc case of the model proposed in the previous sections. 

We assume that the direct arrivals as a function of the user base X = x (user base size A is a 
random variable whose dynamics are described later) are constant, i.e. f{x) = 9 and the growth in 
the stay time due to increase in the user base user base is constant, i.e. g{x) = 1 o. The random 
arrivals/exits in a time interval dt is given by aXdWf, where a is the standard deviation of a 
standard Brownian motion and dWt is the differential of the Brownian motion process. Note that 
if aXdWt is negative then it indicates that more users are leaving, while if aXdWt is positive then 
it indicates that more users are joining. The online hrm chooses the intensiW of advertisements 
and referrals and as a result it controls the rate of sponsored arrivals A{t,X) at time t and user 


g{x) is constant for online firms that do not rely significantly on the social interaction amongst users, and hence 
the stay time does not increase with the growth of the user base. 

In the deterministic case discussed we specified sponsored arrivals only as a function of time, i.e. A(t), as it was a 
sufficient succinct representation 
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base level X. The user arrival process on the online firm follows the stochastic user base dynamic 
given as dX = {9 + A{t,X) — jTr^X)dt + aXdWf 

We analyze the benefits and costs of the online firm. We assume that the online firm has a finite 
capacity T, which is true in practice because the servers of the online firm cannot handle unlimited 
traffic. The benefit function is b : [0,oo) [0,oo) with b{x) = T^ — (T — x)^. b{x) increases as the 

user base size increases from x = 0 up to the capacity x = T beyond which it decreases and b{x) is a 
concave function. In the previous sections we had assumed that b{x) is an increasing function. Here 
we will assume that the capacity is sufficiently large, i.e. T > Or], which will ensure that along the 
optimal user base trajectory there is typically an increase in the benefits. The long-term average 
benefit is given as B{X\A) = b(x(t))e~^^dt, where X = {x(t)}“o represents a sample path of the 

user base trajectory for a given policy A = {A(t, x(t))}^g. The expectation of the long-term benefit 
is given as Ex[B{X\K)\, where the expectation is taken over the sample paths X. We assume that 
the flow cost to the online firm is quadratic and is given as c(A) = c.A^ o. The long-term average 
cost is given as C'(A) = c{A{t,x{t)))e~f*dt and the expected cost is given as £'x[C'(A)], where 
the expectation is taken over the sample paths X. The long-term expected profit of the online firm 
is given as Ex[B{X\A) — C'(A)]. 

We state the online firm’s stochastic policy design problenj^ that takes the initial user base 
X(0) = x.muai as given as follows: 


BstochixinitiahV) = Ex[B{X\A) -C{A)] 

A(t,X)eR 

subject to dX = {6 + A{t,X) — 


1 


-X)dt + aXdWt 


We first make some remarks about our approach to solve the above problem. listochixtmUahV) 
has to satisfy the HJB equation for the above problem. We formulate the HJB equation for the 
above problem and arrive at a solution to the HJB equation. Note that th e above problem is 
an inf inite horizon Linear Quadratic Gaussian (LQG) control problem. From ISpever and Chung 
(120081) we can conclude that the negative definite solution to the HJB equation will correspond to 
B-stochix^n^uahV)- The optimal policy can be derived using Ustockix,„iuai,v) as it is the maximizer of 
the term in the RHS of the HJB equation. Let us denote the policy that we construct as A* {t,X). 
This policy A*(t,X) will lead to a user base trajectory, which is random due to the stochastic 
nature of the dynamic. A sample path of the optimal user base trajectory is denoted as 
and the corresponding sample path for the optimal policy is given as A*,^,^,^^(t), which satisfies 


In iGourio and Rudankol ll2014h the authors assumed quadratic costs for searching for the customers and their 
calibration of the model showed that quadratic costs are a good fit for the data 

In the policy design problem we do not have a constraint on the policy A{t,X) being non-negative. However, if F 
is very large then A{t) will have a low probability of being less than zero. 
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= A*{t, . In the previous sections we analyzed the deterministic counterpart of 

given as this section we will analyze the expectation of sponsored arrivals 

A* (t), where the expectation is taken over the optimal sample paths (t) and is denoted 

as Ex*, [A* , , (01 = Ex*, [A*{t^XZ, , , (t))]- Also, define the expectation of the optimal 
user base trajectory as Ex*, [X* , , (f)] 

We will first briefly discuss the results along the lines of the previous sections. For these results 
we will assume that the standard deviation of the Brownian motion is bounded above by p. The 
details of the discussion that follows have been worked out in Appendix I. We can prove that the 
expectation of the optimal policy i?[A* (t)] decreases and converges to a steady state value as 

f —>■ oo (extension of Theorem 2 and 3), which shows that even though the optimal policy A* (t) 
can take random values it will have a stationary (time-independent) mean as time goes to infinity. 
In Section 6 we discussed the impact of network effects on the optimal policy at the same size 
of the user base C*{x) and the optimal policy at the same time If network effects 

decrease, then the change in the optimal policy depends on the capacity of the online firm F. See 
Appendix I for the details. 

Next, we discuss the impact of increase in the uncertainty in the user base a. We first compare 
the optimal policy A*{t,X) when the network effects decrease at the same size of the user base. 

7.1.2. Impact of increase in uncertainty: comparing the optimal policy at the same 
size of the user base: In the next theorem we show how A*{t,X) changes when the level of 
uncertainty a increases. The optimal policy A*{t,X) does not depend on time t and is in fact an 
afhne decreasing function in X (see Appendix I for details). Hence, for a fixed X the A*{t,X) is 
deterministic. 

Theorem 7. If the level of the uncertainty a increases, then the optimal policy prescribes to 
decrease the referrals and advertisements for the same size of the user base X. 

For the proof of the above theorem refer to Appendix I. 

The intuition behind the above theorem is as follows. When a increases then there is a increase 
in the number of random arrivals/exits. There will be an increase in benefits due to the large 
number of random arrivals but there will also be an increase in the losses due to the large number 
of random exits. Hence, the optimal policy needs to weigh these potential benefits against the 
potential losses and then prescribe accordingly. If there is a large number of exits, then the size of 
the user base decreases by a large value and it will take a large amount of time for the user base 
to increase in size. Hence, if the online firm increases the advertisements and referrals, then it will 
have to bear large costs when the user base size decreases by a large value. Since the costs are large 
and the increase in the benefits due to the increase in the random user arrivals will not be sufhcient 
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to compensate for it (due to concavity of the benefit function), the online hrm is prescribed to 
decrease advertisements and referrals. 

7.1.3. Impact of increase in uncertainty: comparing the user base size and profits 
achieved In this section we aim to compare the expected size of the user base and the prohts that 
are achieved as t —)• oo, i.e. in steady state, under varying levels of uncertainty. The uncertainty in 
the user dynamics has both a positive component in the form of random arrivals and a negative 
component in the form of random exits. In the next theorem we show that under higher levels of 
uncertainty in user arrivals/exits following the optimal policy leads to a smaller expected size of 
the user base in steady state as well as the expected profits achieved by the online hrm. 

Theorem 8. If the level of uncertainty a increases, then the prescriptions of the optimal policy 
lead to a smaller expected size of the user base and lesser expected profits that are achieved. 

For the proof of above theorem refer to Appendix I. Due to an increase in the level of uncertainty 
there is an increase in both random arrivals and exits. We have already seen in Theorem 7 that 
the optimal policy prescribes the online hrm to decrease advertisements and referrals. The increase 
in random arrivals cannot compensate for the increase in random exits and decrease in sponsored 
arrivals. Hence, the expected size of the user base that is achieved in steady state is smaller and 
the same reasoning applies for the expected prohts as well. 

8. Conclusion 

In this work we developed a hrst general model to understand the complementary forces of network 
effects and marketing. Our key hndings show how an online hrm should change its optimal policy 
when there is a decline in network effects and we identify the main forces that determine the change 
in the optimal policy. If there is a decline in the network effects, then an online hrm that is very 
patient is prescribed to increase the advertisements and referrals when the user base is small and 
decrease the advertisements when the user base is large. On the other hand, if an online hrm is 
very impatient, then it should decrease the advertisements and referrals when the user base is small 
and increase the advertisements and referrals when the user base is large. 

We also identify another interesting dimension that differentiates the response of the online hrms 
when there is a decline in network effects. If there is a decline in the network effects and if there is 
signihcant heterogeneity in the revenue generated by the users for the online hrm, then the optimal 
policy prescribes to increase the advertisements and referrals. On the other hand, if there is a 
decline in the network effects and if there is homogeneity in the revenue generated by the users for 
the online hrm, then the optimal policy prescribes to decrease the advertisements and referrals. 
We also analyze the impact of uncertainty in the user base on the online hrm’s optimal policy, user 
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base size achieved and the profits achieved. If the level of uncertainty is higher, then the optimal 
policy prescribes the online firm to be less aggressive, and thus we can show that the expected 
steady state user base and the expected profits that are achieved are lower. 

In this work we made the hrst step towards analyzing the joint forces of marketing and network 
effects. There are several interesting future research topics that can be analyzed. First, we can 
consider the setting where multiple firms compete to build a user base. This will help understand 
the interaction of network effects, marketing and competition. We considered the setting where the 
online firm does not influence the type distribution of the users who enter e.g., through targeted 
advertisements. It will be interesting to extend the model to incorporate this dimension as well. 


Appendix A: Proof of Existence and Uniqueness of the Optimal Policy as a 
function of user base and as a function of time 

In Proposition 1 and 2 we state that the optimal policy as a function of user base and the optimal policy as 
a function of time exist and is unique. In this Appendix we develop the proofs for both Proposition 1 and 2. 
Next, we outline the steps that will lead to the proof of both the propositions. 


and the correspond- 


We will first begin by showing that the optimal policy as a function of time A*. 
ing user base trajectory exist. In order to do so we will use the sufficient conditions proved in 

Dmitruk and Kuz’kinal (|2005r) . Once we show that the optimal policy as a function of time and the cor¬ 


responding user base trajectory exist we will show that the value function starting from Xinmai given as 
{xinitial tT]) is finite. We can extend this and show that n*(x,77) is finite for all values in a finite interval 
[Xinitiai,Xu]- Xu = max{(/““P-|-A'*“P)(77-|-?7'*“Pg““^),a;^}, where Xm is a constant defined in Appendix E. The 
reason why we define Xu in this way will become clear later. Next, we will show that the value function 
n*(a;,r 7 ) is differentiable in the interval x„]. If the value function is differentiable then it will imply 

that it has to satisfy the HJB equation iKushnen (|l968r) . 

pIV{x, r]) = maxo<A<A-P {b{x)) - c(A) -f ^^^^{f{x) + X- :-^:^x)) 

The maximizer of the RHS in the HJB equation corresponds to the optimal policy as a function of user 
base C* ■ We will show that the maximizer of the HJB equation exists and is unique. This will complete 
the proof of Proposition 1. Next, we will need to show that the optimal policy and the corresponding user 
base trajectory as a funct i on of time is unique. For this part, we will use results from the theory of ODEs 


Coddington and LevinsonI (119551) . which will help us prove the uniqueness of the solution to the differential 


exists. 


equations. 

We begin by showing that the (A* 

We now show that the assumptions stated in iDmitruk and Kuz’kinal (|2005r) are satisfied by our model. 

1. First, the policies A(t) are required to be Lebesgue measurable and the user base trajectories x{t) 
is required to be absolutely continuous on every interval [0,r]. Both these requirements are true due to 
the modeling assumptions in Section 3. Denote the rate at which the user base changes as ^ = k^{x,X) = 
f{x) + X— We can also write A) = hr,{x) -I- A. 
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2. We know that k^{x,\) is continuous in (a;,A) because hr,{x) is continuous and A is added to it, which 
is linear. 

3. Next, we will show that any user base trajectory that starts at Xinuiai and satisfies the differential 

equation ^ = f{x) + X{t) — x is always bounded from both above and below by finite constants. Notice 

that = kg{0,X) > hr,{xinitiai) > 0, which imxlies that the user base can never fall below zero. We 

explain this as follows. If the user base trajectory ever falls below zero then the user base trajectory will have 
to cross Xiniuai and will have a negative slope at Xinuiai, which contradicts the fact that ^\x=xi„itiai ^ 0- 
We now show that the user base can never cross + A'”‘^)(7j + yve know that the initial 

user base is less than Let us assume that the user base trajectory assumes a value greater than 
Since the initial user base Xiniuai is less than and the user base trajectory is continuous, which implies 
that the user base will have to cross from below. This means that for the user base trajectory to cross 

approaching it from below the rate of change of user base > 0 will be positive. We can get 


the upper bound on ^\x=x‘^p = kg{Xsup,X) < - 


77+77 


^sup ^ This contradicts the 


fact that > 0. Hence, the user base trajectory always stays below Therefore, we can claim 

that the user base trajectory is always contained in 5" = [xi„itiai-,x‘^'^^]. So, if we add another constraint to 
the problem, which says that the user base is restricted to be in S, the optimizati on problem will still be 


Dmitruk and Kuz’kina 


equiva lent. We add such a constraint to bring the problem in the form required in 

|2nn,4 . 

4. Since Xinuiai is a fixed value, which implies that the set containing the initial value is compact. 

5. Let us define a mapping from time [0,oo) to the set which X{t) always lies in. The set U = [0,A®“p] 
always contains X(t). Hence, the mapping always takes the value U independent of time and is therefore, 
continuous. 

6. The fifth condition in iDmitruk and Kuz’kina] (|2005[ l is automatically satisfied since the sets to which 


the user base trajectory and the optimal poli cy belong are uniformly bound ed. 

7. The function /3{Xinitiai) = 0 (defined in IPmitruk and Kuz’kinal (1200511 1 for our case and hence, it is 
obviously continuous. 

8. Define a function $(t,x, A) = (c(A) — 6(x))e“'’L The functions b{.) and c(.) are continuous in x and A 
respectively. The function is continuous in t as well. The function —5(x)e“^‘ and c(A)e“'’‘ is continuous 
as well. Hence, the sum of these functions, i.e. $(t,x, A) is continuous as well. Since c(A) is convex and there 
is no other term in $(t,x,A) that depends on A, which means that the function $(t,x,A) is convex in A as 
well. 

9. Consider the negative part of the function (j){t,x,X) = (c(A) — 6(x))e“'’*, which is equal to —6(x)e“'’*. 

We need to show that negative part of (j){t,x,X) i.e. limj,/_,,^ jf, —b{x{t))e~'’*dt goes to zero. Observe that 
0 > jf, —b{x{t))e~^*dt > Hence, clearly the limit of the term goes to 

0. Note that this limit will be 0 for all user base trajectories. Also, note tha t here we are using the property 
that both the Lebesgue and Reimann integral take the same value lApostoll (|l974ll 

10. In addition we also know that there is a feasible trajectory i.e. which satisfies all the conditions above 
and gives a finite value of the objective. Consider the case when X(t) = 0 then we need to show that there 
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is a corresponding feasible user base trajectory, which starts at Xinuiai and satisfies ^ = h^{x). We need to 
show that there exists a user base trajectory over the time interval [0, oo) that satishes the above differential 
equation at all times. We will show that the maximal interval of existence of the above differential equation 
is [0,oo). We know that hn{x) is continuously differentiable, which also implies that it is Locally Lipschitz. 
Define a set Z = + 6), where 6 > 0. We know that the function is bounded on Z (because 

closure of Z is compact and hr,(x) is continuous on closure of Z). From iKartsatosI (120051 1 we know that 
maximal interval of existence is either [0,oo) or [0,w] and the user base trajectory at time w has to be 
on the boundary of Z. This means that the user base has to be either 0 or But both of these values 
are not possible. + l and 0 cannot be attained, the reasoning is same as in point 3 given above, where 
it was explained as to how every user base trajectory is bounded between [xinitial, x‘‘'^^]. This implies that 
the only other possibility for maximal interval of existence is [0,oo). This shows that there is a feasible 
user base trajectory that satisfies the differential equation at all times. Next, we want to show that the 
fif(b(x(t)) — c{X{t)))e~'’^dt computed using this user base trajectory and A(t) = 0 exists. The integrand 
(6(a;(t)) —c(A(t)))e“'’‘ < \b{x{t))e~'’^\ < and exists. Hence, the above integral exists 

and is finite. 

Given that all the assumptions required in iDmitnik and Kiiz’kinal (j2005[ l hold, we can use their main 
theorem, which proves the existence of an optimal policy and corresponding user base trajectory pair 
^xists. Lct thc corrcspondiug optimal value of the objective be given as U* {xinitial, v)- 
Note that \Tl*{xinitiai,v)\ is bounded by a constant, which can be shown using the fact that the user base 
trajectory is always bounded. Formally stated {H*{xinitiai,il) \ < /o°° b{x’’'^^)e~'’^dt = Similarly, we can 

get a lower bound U*{xinitial, v) > 

Suppose we take another value for the initial user base Xj, which is between [xinitial, Xu] then also the 
above proof of existence will continue to hold. This means that the objective’s optimal value is well defined 
and finite for all initial user base values between [xinitial, Xu]- We will only need to make some changes in 
the approach to satisfy condition 3, where we show that the user base trajectory is bounded. We showed 
in condition 3 that the user base is bounded above by Now since the initial user base can be more 

than the bound will change to Xn because if the initial user base is more than x”'^^ then the user base 
trajectory will always decrease. Hence, in this case the user base trajectory is in the interval [xinitiai,Xu]- 
In this way we can extend the domain of n*(a;,? 7 ) to [xinitiai,x„]. In fact observe that the domain can be 
extended to any interval [0,s], where s is a finite constant, because the derivation did not rely on the fixed 

values Xinitial, Xn- 

Note that we will in fact only take a fixed value of initial user base for the entire paper Xiniuai ■ The above 
definition of the value function on the entire interval is to consider the optimal policy at different user base 
levels that will be achieved by the user base trajectory from initial user base upto the steady state. 

Next, we want to show that the value function Il*{x^r/) is differentiable in the interior of [xinitial, Xu]- We 
can define another set Sin = [xinuiai — ^-,Xn + l\, where 0 < i < XiniUai- The user base trajectory starting at 
Xinitial is always in the interior of Sin- We know that = hn{x) + X < + A'”'^. The lower bound is given 

as hr,{x) + A > iXu- These upper and lower bounds are derived based on the following facts about the 
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model, the direct arrival rate is bounded above by and below by and the sponsored arrival rate 
is bounded above and below A'”'^ and 0, the user base is in set S and the stay time of the users is bounded 
above and below as well. While the bounds that we arrive at are weak and can be made tighter. As we will 
see that we do not need tight bounds for our purposes. 

Therefore, we can say that ^ always lies in the interior of — w,+ w]. 

The user base trajectory and the corresponding rate of change lies in the interior of the set 

T = X n;„. where x is the C artesian products of the sets. We now show that the assumptions in 
Benveniste and Scheinkman ( IQTol) are satisfied and hence, we can prove that the optimal value function is 


differentiable. 

1. The set T is a finite interval and hence, it is convex and has a non-empty interior. 

2. Substitute X{t) = — h^{x) in x, A) = (c(A) — 6(x))e“^‘ to get x, = {c{j^ — h^{x)) — b{x))e~'"^. 

We know that the h^{x) is concave. Hence, we can see that ^ — hr,{x) is jointly convex in ^ and x. We know 
that c(A) is increasing and convex. But note that the domain of c(.) as assumed in our model is only the 
non-negative real axis. We want to show that c(^ — h^{x)) But in order to be able to use composition rules, 
which use sufficient conditions to show the convexity of c(^ — hr,{x)) we require that the c is defined over 
the entire real line. We can extend the definition of c(.) and assume that it is also defined on the negative 
half of the real line. Even under the extended definition c(.) is strictly convex and increasing. Note that 
having this extension does not affect the policy design problem because we have a hard constraint that A is 
non-negative. So, now we can state that c(^ — h^{x)) is convex. Hence, the function (j){t,x, is convex in 

3. The optimal value function Il*{x,r]) is well defined and bounded for all values in the interval Si^. For 
every value in the interior of the interval Sin the value function is well defined in its neighborhood. 

4. The optimal user base trajectory and the corresponding ^ always lie in the interior of T. Since the 
trajectories are always in the interior we can deduce that there is an e ball around these trajectories, which 
is contained in T. 

Note that the above four conditions are sufficient to show that n*(a;,77) is differentiable. We also get that 
H* {x, rf) is continuously differentiable. Now we want to obtain the optimal policy as a function of user base 
(*(.). We know that the C*(^) corresponds to argmaxo<A<A'»’' , (i(x)) - c(A) + iM(/(x) + A - 
(due to Bellman’s principle of optimality). This maximization is a simple one variable constrained optimiza¬ 
tion problem. Note that there are two constraints A < A®"^ and —A < 0, we introduce Lagrange multipliers 
corresponding to these constraints mi and m 2 . We now state the KKT conditions for the above problem. 
The above problem is concave and the Slater’s conditions (there is a non empty i nterior for the constraint 
setl a re satisfied, hence, KKT conditions given below are necessary and sufficient (|Bovd and Vandenberghe 


200J. 


mi > 0, m 2 > 0] 

A< A'”'^ -A<0 


mi(A-A"“P) =0, 
ox 


m2{X) =0 


-|- mi — m 2 = 0 
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We look at the solution to the above problem in cases. 

(a) Consider the case when > c In this case the first possibility is that A = A®"^ is a 

solution. If that is the case then m 2 = 0 and mi = > 0. Hence, in this case A^^^, m 2 = 0 

and mi = c correspond to the unique solution of the optimization problem. 

(b) Consider the case when g^’^^ < c'(0). In this case the first possibility is that A = A®“^ is a 

solution. If that is the case then mi = 0 and m 2 = c'(0) — > 0. Hence, in this case A = 0, mi = 0 and 

m 2 = c'(0) — correspond to the unique solution of the optimization problem. 

(c) Consider the case when c'(0) < < c'(A'”‘^). From complementary slackness we can see that 

mi and m 2 will be zero. In this case c (A) = and in this case we can use the property that c (A) is 

strictly increasing (due to strict convexity of c(.)) to conclude that the solution to A in this case will be 
unique and we can concisely write it as A = c'~^ ). Since c'(0) < gf^^ < c'(A““’’) we can see that 

A = is primal feasible. Hence, in this case A = mi = 0 and m 2 = 0 correspond to 

the unique solution of the optimization problem. 

From the above discussion we can conclude that the optimizer as a function of the user base exists and 
is unique. In proving the uniqueness above we did not use the Assumption 1 part 1. In fact we require 
Assumption 1 part 1 for the uniqueness of the optimal policy as a function of time. 

So, up until now we have shown that the optimal policy as a function of user base exists and is unique, 
which proves Proposition 1. We have also proven part of Proposition 2, since we know that the optimal 
policy and the corresponding user base trajectory exists. Next, we aim to show that the optimal policy as 
a function of time is unique. The optimal user base trajectory satisfies the following differential equation, 
^ = f{x) + X{t) — We already know that the optimal policy as a function of user base is unique 

and is given as C*(-)- If II^® aser base at time t is x{t) then the optimal policy at time t is (*{x{t)) (This is 
true because of the following two reasons, i) optimal policy as a function of time needs to satisfy 

Bellman’s principle of optimality, ii) optimization problem has an infinite horizon). Hence, we can formulate 
the differential equation that the optimal user base trajectory needs to satisfy is given as ^ = f{x) + (*{x) — 
-, ^ , ' X. If we can show that the solution to this differential equation is unique then we are set because then 

77+773(2;) ^ ^ 

(*{x{t)) will be uniquely determined. We can write the differential equation as ^ = h^{x) + C*{x). The first 
term hr,{x) is continuously differentiable on {xinitiai,Xu) and if the second term is continuously differentiable 
as well then h^(x) + C*(a;) will be continuously differentiable on {xinitial, x^)- We know that an optimal user 
base trajectory exists defined over the time interval [0,oo). Thus, we can use standard r esults on uniqueness 
in ODEs, which extend Picard’s results to global uniqueness ICoddington and LevinsonI (|l955r) . Next, we still 
need to ensure that the second term is differentiable. 

c (0) < g^’^^ < c'(A®“P) holds for all user base levels in the interval [xinitial, Xu]- We know that the user 

base trajectory starting at Xi„itiai < always lies in the interval [xinuiaux^,] and thus, we can conclude 

the sponsored arrivals at all times will be given by c'“^(^2-^^). This condition ensures that the optimal 
policy is always in the interior of [0, A®“^]. If is continuously differentiable then we can conclude that 

{x,r,) ^ continuously differentiable. 
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Besides the above cases there can be scenarios when the above conditions do not hold. In those cases it may 
be difficult to show the uniqueness of the optimal policy as a function of time. In the proofs of theorems that 
will follow we will discuss the impact of the case when there is non-uniqueness. Essentially, we will argue 
that all the results that we will present (Theorems 4-6) do not rely on uniqueness of the optimal policy. The 
results presented in those theorems will hold for all the optimal policies, if there are multiple of them. 
Appendix B: Proof of Existence and Uniqueness of the Steady state 


In Theorem 1 we state that for every online firm in our model there is a steady state, which is unique. 
We first show the existence part of the proof. In steady state we know that the user base is fixed, which 
means ^ = 0. Hence, A = —h^{x) is the first condition for the steady state. In Proposition 2 we used the 

following fact that c (0) < < c (A®"^) for all x G [xinitial,x^]- If this is true then the optimal policy 

is always given as Hence, we can write the rate of change of user base as ^ = h^[x) + 

[Xinitiai ) + c'~^jg greater than zero. Also, note that hr,{Xn) + c'-^ C^ < 

hriixu) + A®“^ < 0. Therefore, either h^{xu) + + A®“^ = 0, which proves that x^, is 

steady state or hr,{Xn) + ^ h^{Xn) + A®“’’ < 0. If the latter is true then from continuity of 

hr, {x initial) + (^initial :v) .^g know that there exists a steady state. 

Suppose that there are two steady states and x^ . The corresponding values of the policies are as follows 
Aj = —hr,{x\) > 0 and A^ = —hr,{x^) > 0. Without loss of generality consider x] < x^. Aj > 0 which gives 
hvi^l) < 0 s-iid similarly h{x^) < 0. Recall that hr,(xinitiai) > 0 and hr,{x) is concave, which we can use to 
deduce that hr,{x\) > which implies A^ < Af. We know that ^ (Aj) and = c (A^). 

Note that c is an increasing function and ^ decreasing function in x (due to concavity Lemma 

1). RHS of steady state equation corresponding to x^ ~ c (Af)) is higher than the RHS of steady 

state of while its LHS is lesser. Thus, the second equality cannot hold, which leads to a contradiction. 
Appendix C: Proof of Convergence to the Steady state 


In the previous section we proved that there exists a steady state. In this section we prove Theorem 2. We 
know that there exists a unique user base trajectory that satisfies the differential equation ^ = hr,{x) + 
^,_i(8n_^^) g^i- Xinitiai- Also, we know that hr,{x) + is positive in the interior of the 

interval [xinitial, Xs]. And if we choose a user base level sufficiently close e to Xs and it is given as Xs — e. Then 
hn{x) +c'“^(^2-^^) > e for all the values of a; G [xinitiai,Xs — e]. This means that ^ > e' up until x> Xa — e. 
This means that starting with Xiniuai the user base trajectory will reach the threshold in at most time, 
which is finite. In this way by repeating such an argument we can show that the user base trajectory is 
bound to converge. As soon as the x{t) reaches Xg the rate of change will be zero. The corresponding optimal 
policy will also converge as the user base trajectory converges. 

Appendix D: Proof of Monotonicity of the Optimal policy 


We state a lemma below, which states that the optimal value function is concave in x. We discuss this Lemma 
separately as we need it even at l ater stages in other proofs. However, the proof of the differentiability in 


Benveniste and Scheinkmanl (|l979r) also proves this lemma as well. 


Lemma 1. The optimal value function n*(a;,? 7 ) defined on the interval [xinitiai,x„] is concave in x. 
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We know that b{x) — c(^ — hr,{x)) is jointly concave in {x, and {x, G T, where T is convex. This 
establishes the concavity of Il*{x,r]). 

We already know from the previous subsection that the optimal user base trajectory converges. Clearly 
since > 0 at all points on the path before the policy converges, which implies that 

the user base increases strictly before converging. The corresponding optimal policy as a function of time is 

= c'“^(- —“)■ Note that c'“^(.) is increasing and is a decreasing function in x 

because If*(a:, 77 ) is concave. These factors combined with the fact that x*. ^(t) is an increasing function 

of time lead us to conclude that the optimal policy will decrease with time. 

Appendix E: Proof of results on impact of network effects on optimal policy at the 
same size of the user base 


In this section where we compare the optimal policy as a function of user base Up until now we have 
shown that the optimal policy as a function of user base is well-defined for [xinuiai, ■ 

In Theorem 4 we show if the online firm is patient, i.e. a low p then the change in the optimal policy 
depends on its user base size. We are interested in analyzing the case when the solution A lies in the interior 
of the interval [0, A““^]. We are only interested to analyze the impact of network effects on this case because 
otherwise when the solution lies on the boundary then the change in the network effects will not change 
the solution. When the solution lies in the interior then the HJB equation can be written as follows. Let us 
denote = n'‘(x, 77 ) and c'-\X) = q{X) 


pU* (x, 77 ) = 6 (x) - c(g(n'* (x, 77 ))) -b n'* (x, 77 ) {hr, (x) -b ( 7 (n'* (x, 77 ))) 


dn*(x,77) ,, ,,, 5n'*(x,77) ,, , , , ,,, dn'*(x,77) 

p—= —^y^ + (h„(x) + g(n'‘(x,77)))- 


dp 


dp 


dp 


T-r/*/ X //T-r/*/ XX^n'*(x,77) .dhJx) 

n'*(x,77)q'(n'‘(x,77))—A^ + n'*(x,77)- 


dp 


dx 


We will rearrange the above equation to bring all the terms that do not contain on the LHS to 

obtain p ^^ Simplifying the LHS term 


dn*(x,?7) 

^ dp 


n'*(x,77) 


(77-b77g(x))' 


:9{x)^ 


( 1 ) 


Next, we will show that is positive. The intuition for this result is that when there is an increase 

in the network effects then the users stay longer at the same size of the user base, which causes the user base 
that is active at any time t to be higher. 

In order to figure out we will find a lower bound for it. Consider 77 and p + Ap. The optimal policy 

as a function of user base at 77 is given as (*{■). Construct another policy which is given as Cv+^v — ~ 

hn+Ar, + Q ■ Note that Ap can be chosen to be sufficiently small such that — hn+Ar] + Q is in the interior of 
the set [0, A““’’]. The motivation to construct this policy is as follows. Consider the rate of change of the user 
base when network effects is p and the policy is (*, ^ =(* + h^. Consider the rate of change of the user base 
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when network effects is rj + Ar] and the policy is Cn+Ar,, = Cr 7 +A 77 + ^ 77 +A 77 - We simplify the rate of change 

by substituting for Crt+Ar, to get =(* + h^- Now we see that the rate of change ^ 

and since both a;^(t) and start at the same point. Our other assumptions ensure that Q +/i,, satisfies 

conditions for the uniqueness of the solution to the differential equation. Hence, both Xrj{t) = Xr,+/\n(t) and 
we denote it for simplicity as x{t). Clearly, the benefits achieved along both the policies is the same. But 
the latter policy needs to give out fewer advertisements as hr,+Ar,{,x) > h^{x). Therefore, the cost is lower at 
all times, which leads to a positive change in the value function. Let us compute the difference in the costs 
at all times. We will use the notation Xr,{t) and Xn+Ar,{t) for the policies at time t for simplicity. c(A,,(t)) — 
c(A,,+A 77 (i)) = c{Xrj{t)) — c{Xrj{t) + hr,{x{t)) — A 77 (x(t))). Siiicc Ar] is small this difference can be simplified 

to c'(A„(t))(/i„+A^(x(t))-/i„(x(t))). Hence, we can write ^ c'(A„(t)) ^ c'(A„(t)) ■ 

We know that the second policy that we obtained is not necessarily optimal, but we can use the difference in 
the value function of the optimal policy at rj and the suboptimal policy at 77 + Arj. We know that there is no 
difference in the long-term benefits. Let us compute the difference between the long-term costs. C'(A,,(.)) — 
C{Xr,+Ar,{-)) = Jf‘{c{X^{t) - c{X^+^r,{t))e-P^dt. Using the ^ ^ 

we can get = /p°° = Jf’ c (A,,(f)) Since each of the terms in the 

integrand is strictly positive we can deduce that > 0. Hence, we get > 0- Next, 


dr] — dr] 

^g{x)x is negative. 


we will show that if p is sufficiently small then — C^*(x, rf) 

First note that n'*(x, 77 ) is positive. The intuition for this is as follows, consider two user base levels x and 
X -I- Ax, where the latter has a higher value. If the online firm starts at a higher user base level x -I- Ax then 
even if the online firm follows the same optimal policy as it did starting at x then also the user base trajectory 
starting at x -I- Ax will be higher. A higher value for user base at all times wil l lead to higher benefits and 


Benveniste and Scheinkman 


the co sts will be the same. This leads to an improvement in the n*(x,? 7 ). In 
(|1979ll the authors in fact gave a technique to simplify the n'*(x, 77 ). In our case if we use their technique 
we get n'*(x, 77 ) =c'(C*(x)). Although, the simplification does not give an explicit value, it still depends on 
the C*( 2 ^)) which should not matter for us because as we want to arrive at sufficient conditions to make 
p ^^ — n'*(x, 77 ) ^^ 2 g( 37 )x negative. We can get a lower bound on c'(C*(x)), which we know is 


Define C, — 


Note that the supremum involves three terms the user base 


level, network effects and the discount rate . Also, if gf^^ is continuous 77 , x, p then the supremum is finite 
and positive. This is because we know that gf'^^ is positive and a continuous function on a compact set is 


RudinI (|l964ll . Define the infimum of C = g{x)x. Now we are ready to arrive 


bounded 

at the bound for p. p"" = If p < p**” and if p < <5 then the term p ^^gf'^^ — H'*(x, 77 ) g( 37 )x is 

negative. We needed the additional condition in the form of p < <5 because we considered the only the p’s 
that satisfy this constraint while deriving C. Hence, we can define the constraint on p as p < p = min{p**’, J}. 
In fact we can also define a stricter threshold p<p\ where p* =min{p*,pj}. The expression for p\ is given 
in Appendix F. Note that p* is defined in such a way to have a common lower bound defined for p. 

The term in the RHS is 

an*(x,p)^ 


dp 


-(/i„(x)-f g(n'*(x,p))-f n'*(x,p)(7'(n'*(x,77) -c'(g'(n'*(x,77)))) 


( 2 ) 
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If hn{xiniuai) > then we have the condition that {hr,{x) + g(n'*(a;, 77 )) + n'*(x, 77 )g'(n'*(a:, 77 )) — 

c'(( 7 '(n'*(a;, 77 )))) is greater than zero in a neighborhood close to the initial user base. Hence, < 0 for 

all the user base levels close to Xinuiai- In fact we can define a threshold Xi, where = c'(( 7 '(A'’“^)). If 

X < Xi, then /i,,(x) < c'(g'(A'”‘^)). Therefore, we can state the same for which implies that the 

optimal policy increases when there is a decrease in the network effects at low user base levels. 

We know that n'*(x, 77 ) is bounded above and below, because we are considering the case when the optimal 
policy is in the interior. Thus, q{Il'*{x,r])) + n'*(x,77)77'(n'*(x,77) — c'{q'{Il'*{x,r]))) is bounded above and 
below. Also, note that h^{x) is a continuous function, which approaches —00 as x approaches 00. This means 
at large enough values of x h^{x) +g(n'*(x,77)) + n'*(x,77)g'(n'*(x,77)) — c'(g'(n'*(x,77))) is negative. Let us 
now derive the threshold on the value of x at which the expression above is negative. We know that (7(n'* (x, 77 )) 
corresponds to the optimal policy, which is bounded above by A'”'^. We know that n'*(x, 77 ) < c (A'”'^). We 
can arrive at the supremum of x.q'{x) — c'{q'{x)) in the interval < x < c (A""^) and let us denote it by 
sup_ \Ye know that f{x) is bounded above by and the lower bound of -, ^, ^ x is In this 

expression /i^(x) + q(Jl'*{x,r])) + n'*(x, 77 )g'(n'*(x,77)) — c'(g'(n'*(x, 77 ))) we will replace each positive term 
by its upper bound and the negative term by the lower bound and thus obtain the upper bound for the 
expression. Then, we will derive a sufficiently large value of x when the upper bound is negative. Based on 
the above substitutions we get x > X 2 = (/““p + + A ''“^)(?7 + then the expression is negative. We 

can also define Xm, Xm = (/““J’ + + rj^'^vgsup'^ _|_ -^^here fc > 0 . 

Hence, > q. This means that the online firm decreases the advertisements and referrals when there 

is a decline in the network effects. This proves part I and 2 of Theorem 4. 

Before we prove the remaining parts of Theorem 4 we want to make a remark about the previous proof. 
We showed sharp results about what happens when the user base is close to initial user base level and then 
at very high levels. First of all the bounds that have been arrived are not tight and can be potentially be 
made tighter. Secondly, the part of the result which discusses the comparison at high user base levels but 
does not discuss whether the user base level is higher than the steady state level or lesser. If it is higher 
then it will not be achieved along the optimal population trajectory. But if it is lesser, for e.g., consider the 
scenario when the following is true x.g'(x) < c'(g'(x)) Vx € [xinUiahX^] then at user base levels close to the 
steady state the switch (from increasing to decreasing advertisements) in the behavior of the online firm will 
be observed. 

If p is sufficiently high then just the opposite effects of the previous theorem are observed. We already 
know that the RHS in (2) changes sign when user base increases from low to high values. All we need 
to show that if p is sufficiently high then the sign of the LBS in (I) is always positive. We can arrive 
at the supremum of We arrive at the infimum of and denote it as C = 

We know that C is non-negative, we assume that it is strictly greater 
than zero. Hence, if p > c ^A'”'^)^ then the term p ^^ — n'*(x, 77 ) ‘^^^^'''^ is positive. We can define the 
constraint on p as p > p“ = maxjpj, c (A'”‘^)y} This proves part 3 and 4 of Theorem 4. 
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Appendix F: Proof of comparison in the steady state 

Consider the case when occurs in the interior of the interval [0,A®“^] then we know that the following 
condition has to hold c (A^) = objective is to simplify the expression in ■ Consider 

a function W{x,r]) = _ \Y(^x,r]) is concave for the reason reasons as in proof in Appendix A and 

n*(a;,r 7 ) is concave (see Lemma 1). Note that W{xa,'r]) =11*{Xs,r]). We define thi s function in order to be able 
to compute the in terms of the derivative of W{x, rf) using the result in 

iiordi 


Benveniste and Scheinkman 


Rockafellaii (2015). 


As = —h{xs) is in the interior so there is a small neighborhood around —h{xa), which is also in the interior 
of [0,A'’“^]. Let us consider another policy in which the online firm has —h{x) sponsored arrivals if 0 < 
—h{x) < A'*'“^. If —h{x) > A®“’’ then the online firm gives A®“’’ sponsored arrivals and when —h{x) < 0 then the 
online firm gives 0 sponsored arrivals. For this policy we can compute the value function in the neighborhood 
of —h{xs)j which is in the interior of [0, A®”^] to get jg gq^^al to W{x,r]). We also know 

that this policy may or may not be optimal, which means n*(a;, 77 ) > W(x^r]) in the neighborhood of —h{xs). 


Hence, from the theorem in 


Benveniste and Scheinkman (1979) Rockafellar ( 2015 ) we know _ 


• We also know that c (A^) = Therefore, substituting A^ = —h{xa) we have 

dW^Xs.r]) 


c {-h{xs)) = ■ 


dx 


c {-h{xa)) = 


h' {Xa) + C 


b' {Xa) 

_ dhr,{xa) 
dx 


p 

C ( 


The solution of the above equation gives the steady state user base level Xa ■ ■ We are interested in analyzing 
the steady state value when the network effects decrease. Basically as the network effects decrease, we know 
that the steady state still exists (because the result in Theorem 1 is true for all values of 77 < 77 ®“^). Next, we 
will derive an expression to compute the rate of change of Xa ■ In this proof we will make the dependence of 
Xa and As on 77 explicit for clarity and denote them as Xaih) and Xaih) respectively. Note that Xaijf) is a well 
defined function because we know that the steady state exists and is bounded for all the values 77 < 77 ®“^. In 
order to analyze the Xa{ii) as a function 77 , we first need to show that Xa{ii) is continuously differentiable in 
77 . Consider the steady state equation 

b'{xa{r])) 


dhn(xs{r])) 

dx 


= c {-h{xair]))) 


( 3 ) 


The smoothness properties of the functions in the LHS and RHS of the steady state equation (3) can be 
used to show that Xs ( 77 ) is continuously differentiable. Let us obtain the expression for ^ by differentiating 
both the LHS and the RHS in the above equation (3), where ^ is the partial derivative w.r.t x. 

b"{Xa{r])) dxa{ii) b'{xa{r])) 


p-h'^{xs{v)) dr] 
-c {-hr,{xa{r])){ 


,dh'^{xa{r])) dxairj) dh'^{xa{v)). 


{p-h'Jxa{ri))Y dxa{r]) dr] 


dr] 


dh^{Xa{r])) dxa 


dXair]) dr] 


dhr,{xa) 

dr] 
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In the above expression we can rearrange the terms to obtain ■ 

dxs{r]) b''{xs{r])) b'{xs{r])) 


dr] p-h'(Xs{r])) [p - h'(xsir]))y 


b'{xs{p)) 


-c {-h^{x,{r]))) 


dx^{r]) 

dh^{xs{T])) 


) + c {-hr,{xs{p))) 


dhrjjxsjp)) 

dXs{p) 


) = 


9v {p-K^Xs{p)Y) dp 

We know that \s{p) = -h^{Xs{p)). Hence, Substituting the 


dxs{r]) 


dr] 


dr] 


expression for in the expression for we can obtain an expression for that depends 

on Xs{p) and other known functions. Hence, the only unknown term is Xg{p). Our objective is to 
find the sign of Simplifications of the above expressions yield that the sign of i 


IS 


the tame as It ^ < 0 then 

b"(T (-n)) b'(xa( 77 )) / Q/t^(xg(T 7 )) 

V s \ /// drj p—h'^{xs{r])) ^ drj dxs{r]) dxs{r)) drj 

y dh^{xs{r])) dhjj(xs{r])) dh^{xs{ri)) dhjj(xs{r])) \ _ 

^ dn dxc(ri) dx,:,(u) dr) ' / 


■ II ' 

h" (t I'n'l'l b {xg{r,)) idh^{xg{r,)) ahr,{xg{v)) 9h^{x s(v)) dhr,{x g{ri)) —tiirh imnlir" *'"0'>0)).,. ^ 

0 [Xs[p)) -- p-h>(xgM)^ - ^ -aMilj-- ^implies < 


/■ u' ( I \\\ dhrjixsiv)) 
{p-h^{xs{v))) -- 

lows p< A'(!.(,)) + ( “■‘I'"”".'!!?.'!" - ’iol'”"'?;"'" ) 


. We can further rearrange the equation as fol- 


t. This condition in the pre- 

dhjjixsiv)) b (xsiv)) ^ 

^ I."(xs(r 7 )) 

vious equation is sufficient to deduce the sign of but the condition depends on Xs{p), which we do 

not know. Define the infimum of the term on the RHS of the equation, inf 3 ,g[o, 2 ,„] m(a;), where m{x) = 
aMD _ £^ai;gM) ). Define a threshold for p[ = inf,g[o,ht„] m{x) Hence, if p < p[ 

then < 0. The above result holds for the uniform threshold p< p\ see Appendix E. 

Similarly, we can define an upper threshold for p as well sup„,gjQ_„,^] m(a;) and pj* = sup„,g[Q_„,^] m(x). On 
the same lines as above it can be shown that if p > p“ then > 0. 


Next, we prove Theorem 6. We know that 

{xsij])) _ ^ ,dh'^{xs{p)) dh^[xs{r])) dh'^{xg{p)) dhr,{xg{p)) ^ 

-x.,[p)<[ ) 


siv) 


dp 


dxs{p) 


dxs{p) 




( 4 ) 


b'{xs{p)) 
is equivalent to ^^gf^ < 0. 

If we take the supremum of the term in the LHS of the inequality (4) and the infimum of the term 
in the RHS in (4), then it will be sufficient condition to ensure that ^^gf^ < 0. Define the supremum 


of the term in LHS as follows sup i ^ and the infimum of the term in the RHS as follows 0 = 


inf. 


aie[ 0 ,a:„],?)e[ 0 ,J 7 ‘''“P] 1 g^ g^. 


,dh^(x) dhp(x) Bh^jx) ahr,{x) 


Therefore, if sup,,g[o,,,„, < 0 then ^ < 


BXaiv) 


dr] 


0 . 




Similarly, we can define A = sup,j, 


Bh^jx) dhp(x) Bh^(x) ah^{x) 


xe[o,x,^],7ie[o,77®^p] \ ap dx ax ap 1 l y 

lines it can be shown that mfj,g[o, 3 ,„] > A then ^^gf^ > 0. This proves Theorem 6. 

Appendix G: More general benefit fnnctions 
G.l. Model and results overview 


On the same 


In the Sections 3-6 we assumed that the benefit function is concave. We now discuss the extension of our model 
to more general class of benefit functions. The benefit function b{x) considered is a continuously differentiable 
increasing function that is not necessarily concave upto a threshold tthreahoid and beyond the threshold the 
benefit function is concave. Formally stated b : [0,tthreshoid] —t R+ is convex and b : [tthreahoid, 00 ) —t R-i- is 
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concave. The benefit function is continuously differentiable and increasing on the entire domain [0,oo). An 
example of such a function is sigmoid function which is S-shaped. We will assume that the threshold tthreshou 
is not very high, i.e. tthreshoid < This assumption implies that the threshold is less than the steady 

state user base level. This is reasonable because closer to the steady state the market for the online hrm 
starts saturating and that is why the benefits will also saturate. 

For the above class of benefit functions we can extend the following results discussed in Sections 3-6. 
Considering more general benefit functions do not affect the proofs of Proposition 1, Proposition 2, Theorem 
1 and, Theorem 2 do not change much and these results continue to hold. In Theorem 3 we had shown that the 
optimal policy of advertisements and referrals was monotonically decreasing. This does not completely extend 
to this scenario. The reason for this is that the marginal benefit of adding another user when the optimal 
user base trajectory is in the regime ]f),tthreahoid\ can increase due to the convexity of benefit function in the 
regime. However, it can be shown that once the optimal user base trajectory enters the regime [tt/ireshoid, oo) 
the advertisements and referrals decrease with time, which is due to the concavity of the benefit function 
in this regime. The comparisons in Theorem 4, which show the impact of network effects on the optimal 
policy as a function of user base continue to hold. Also, the comparisons of the optimal policy in the steady 
state under a change in network effects shown in Theorem 5 and 6 continue to hold. Next we give a detailed 
discussion of results. 


G.2. Detailed discussion of results 

In the model presented in Sections 3-6 we had assumed that the benefit function is concave. Consider a 
more general benefit function defined as b : [0, tthreshoid] —>■ R-i-j where b{.) is a continuously differentiable and 
increasing function (not necessarily concave) on the domain [0, tthreshoid]- In the domain [threshold, oo) the 
function b: [tthreshoid, oo) —R+ is concave and increasing. On the entire domain [0,oo) the benefit function 
is continuously differentiable and increasing. Next, we discuss how do the results presented in Section 3-6 
change for more general benefit functions. In Proposition 1 we discussed the existence and uniqueness of 
optimal policy as a function of user base (C*. It can be shown that the optimal policy as a function of time 
exists using the same steps as in Appe ndix A. The fact that benefit fu nction is not concave does not impact 
the conditions required for existence in iDmitruk and Kuz’kinal (120051) . If this is true then the optimal value 


function n*(x,77) is well defined. If the value function IV*{x^r]) is well-defined then we can follow exactly the 
same steps as given in Appendix A to show that the optimal policy is unique. Since the optimal policy as a 
function of user base C* is unique we will use this to solve for the optimal policy as a function of time. The 
optimal policy as a function of time will be equal to C*(t)), where (t) is the optimal user base 

trajectory. Next, we would need to show that the optimal user base trajectory as a function of time exists 
and is unique. Due to Assumption 2, we can show that the solution to the differential equation of the user 
base exists and is unique. This will establish the uniqueness of the optimal policy as a function of time. 

We next discuss how do the theorems change in this setting. 

In Theorem 1 we showed the existence and uniqueness of the steady state. In order to prove the existence 
we had exploited the property that the user base trajectory is bounded and increasing. In this case as well the 
user base trajectory is bounded above and increasing for exactly the same reasons as mentioned in Appendix 
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A, therefore, steady state will exist. In order to show uniqueness we will exploit the concavity of the n*(a;,? 7 ) 
in the neighborhood of the steady state (justified in next paragraph) and use the same steps as before. The 
same idea can also be used to show the convergence as in Theorem 2 (increasing and bounded function will 
converge). 

In Theorem 3 we had shown that the optimal policy was a decreasing function of time. For this we relied on 
showing that n*(x, 77 ) is concave in x. In this case however, it is not necessarily true that 11* (a;, 77 ) is concave 
on the entire domain [0,oo). This is because the benefit function is not concave in the domain [0,tthre3hoid\- 
However, since the benefit function is concave in the regime [tthreahoid, 00 ) we can show that H* (x, rj) is concave 
in the regime [tthrs 3 hoid,oo). In order to show this we can write the net benefit function b{x) — c(^ — h^(x)), 
which is concave in [tthreahoid^oo). Note that once the user base trajectory crosses the tthreahoid it will stay 
above Uhreahoid, this is because h^itthreahoid) > 0 (follows from the assumption that tthreahoid < Since 

the user base trajectory always stays in the regime in which the net benefit function is conca v e and since the 
regime itself is a convex set, we can use the same result as in iBenveniste and ScheinkmanI ([l979|) to show 
that the ir'{x,r]) is concave in [Uhreshoid, 00 ). Also, note that we can show that the user base trajectory is 
increasing, this is because the rate of change of user base h^{x) +C*(x) > 0 at all points before the steady 
state. The steady state value has to be greater than > Uhreshoid, which means that the user base 

trajectory will at some point exceed tthreahoid- Since n*(x,? 7 ) is concave in [tthreahoid, 00 ) and this combined 
with the fact that the user base trajectory is increasing and will cross the threshold tthreahoid we can conclude 
that the policy will decrease with time after some time (when user base exceeds tthreahoid)- 

In Theorem 4 we show the impact of the level of foresight and the size of the user base on the online firm’s 
decision when it sees a decline in the network effects. In Appendix E, where we show the proof for Theorem 
4 we only rely on the fact that the benefit function is increasing. So Theorem 4 continue to hold in this 
setting. 

In Theorems 5 and 6 we analyze the impact of change in network effects on the optimal policy as a function 
of time. In the proof shown in Appendix F we developed the steady state equation. In developing the steady 
state equation we used the fact the value function is concave in the neighborhood around the steady 

state value. Note that the steady state value is more than tthreahoid- Since n*(x,? 7 ) is concave in the region 
[tthreshoid,oo), this means that Il*(x,ri) is concave in a neighborhood around steady state. Therefore, we can 
develop the same steady state equation and the results analogous to Theorem 6 and 7 can be arrived at. 
Appendix H: Users take decisions to join the online firm 
H.l. Model and results overview 

In this section we extend our model to allow for users decisions. We first describe the additions in the model 
discussed in Section 3. In Section 3 we had assumed that the direct arrival rate was f{x). We need to modify 
the direct arrivals to take users’ decisions into account. Suppose fix) users hear about the online firm (for 
e.g. word of mouth) per unit time and have to decide whether to enter the online firm or not 0 Each user 


fix) users can also comprise of the users who already know about the online firm and need to decide whether to 
visit the online firm or not 
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is rational and has a preference a for the online firm drawn from a uniform distribution [amin, o^max] li!l . The 
user’s benefit from joining the online firm increases with the size of the user base. The user’s benefit also 
increases with the preference of the user for the online firm and it is given as ax. Each user bears a fixed 
cost for joining the online firm given as center- The utility of the user from entering the online firm is given 
by ax — Center Slid is zero otherwise. At every time instance fix) users need to decide whether to visit the 
online firm and suppose a fraction w{x) of these users decide to join the online firm per unit time, then the 
direct arrival rate is given by fix)wix). 

In Section 3 we had the sponsored arrival rate at time t given as A(t), which was a result of the adver¬ 
tisements and referrals given by the online firm. Suppose the online firm can control the number of users 
who come across the advertisements X{t) users. These users need to decide whether to enter the online 
firm or not. Hence, if a fraction wix) of users decide to enter the online firm the rate of the sponsored 
arrivals is Xit).wix). Therefore, the rate at which the user base of the online firm changes is given as 
^ = f{x)w{x) -b Xit)w{x) — X. The benefits and costs for the online firm remain the same as in the 

Section 3. Hence, the long-term discounted average profit of the online firm is given as B{x) — C{X) (as 
defined in Section 3). 

Before we discuss the equilibrium of the above model we make a remark about the model discussed above. 
In the model discussed above we can also include the payments made to the online firm directly in the 
form of subscription fees. We can do so by adding a constant fee to the cost of entry for the users. We also 
need to modify the benefit function of the online firm as follows. The benefit function will be sum of bix), 
which represents the benefits generated by users clicking on advertisements, and a term that is equal to 
the subscription fees times the rate of arrivals (sum of the direct and the sponsored). We do not add the 
subscription costs to avoid a complicated description. 

H.1.1. Equilibrium analysis: In the model discussed above both the users and the online firm make 
decisions and want to maximize their own utilities. We want to compute a policy for the online firm and the 
users in which both the online firm and the users cannot deviate unilaterally to increase their utilities. 

Let us consider a user with a preference level a for the online firm. Suppose the user comes across the 
online firm directly or through an advertisement then he needs to decide whether to enter the online firm or 
not. The user has two actions {enter, not enter}. We assume that the user knows the size of the user base 
p. If the utility from entering is greater than zero a.x — Center > 0 then the unique best response of the user 
is to enter the online firm and not enter other wise. Note that the user is assumed to be myopic and hence, 
the best response only depends on the size of the user base p and not on the future decisions to be made 
by the online firm. Hence, the best response of the user as discussed above gives the strategy of the user in 
equilibrium. Let the set of a’s which satisfy a.x — Center > 0 be given as [a(a;), amax]- Hence, the fraction of 
users who enter the online firm is given as w’"’(x) = , 

The online firm has a belief about each user’s actions. The online firm also knows the preference distribution 
of the users. We assume that the online firm holds the same belief for users of the same type a. Hence, the 

Uniform distribution is not a restrictive assumption and the results discussed in this subsection extend to more 
general distributions as well 
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online firm will have a belief about the fraction of the users that will enter, denoted as w{x). The online 
firm’s profit maximization problem for the belief function w{x) is stated as follows. 


^{xinitiauv) = max B{x)-C{X) 

0<A(t)<A'>’‘P.Vt>0, 

dx 1 

subject to — = fix)w{x) + X{t)w{x) - — - 

at (r] + r]g{x)) 


The optimal solution of the above problem is denoted as and the corresponding user 

base trajectory is given as (t, ih(.)). Note that the optimal policy is a function of the belief w{.). In 

equilibrium the fraction of users joining the online firm is given as w’^’^{x). The online hrm computes the 
optimal policy in equilibrium with the belief w{x) = w’^'^{x). We already know that the users will not want to 
deviate from their decisions that result in (x) fraction of users entering. The online firm will not want to 
deviate either as it maximizes its profit. The equilibrium policy as a function of time is denoted as ^ 

and the corresponding user base dynamic is denoted as ^ 

Next, we can also formulate the HJB equation for the above problem in the same manner as in Section 3. 
Let us denote the optimal policy as a function of user base as (•)■ 

We can show that the results presented in Section 4-6 continue to hold under certain assumptions. We 
now give the intuition of how to reduce the above analysis for the case above to the analysis in Section 
4-6. Let us consider the differential equation in Section 3 and the one developed above. The first thing to 
note is that the function f{x) represented the direct arrivals in Section 3. Here, we have a new function 
which is f{x)w{x). Earlier we did not have a term multiplied with A(<), while here we have a term of w{x) 
multiplied with X(t). This is an important difference in comparison to the earlier sections. We can define a 
function h^{x) = f{x) — x. If we replace hr,{x) in Sections 4-6 with and also add some more 

regularity conditions then the main results presented continue to hold. Next, we give a detailed discussion 
about the results. 


H.2. Detailed discussion of results 

As pointed out earlier in this case there are two differences in this extension. First, the rate of direct arrivals 
changes from f(x) to f(x)w(x). w{x) is increasing and bounded above, which is implied from its definition. 
We can assume in addition that w{x) is continuously differentiable and also assume that it is bounded below 
by a positive constant. We assume that amin = 0 and a^ax = 1 without loss of generality. If the distribution 
of the users is uniform then a user enters if a > . Hence, if a; > center then only will a user enter. The 

fraction of users entering if a; > Center is w(x) = (I — Now if Xinuiai > Center(l + k) then the function 

w{x) is continuously differentiable and is lower bounded by Second, the rate of sponsored arrivals 
changes from A(t) to X{t)f{x) 

In Proposition I and 2 we showed the existence and uniqueness of the optimal policy. First we showed 
that the optimal policy as a function of time exists. For that we used the sufficient conditions shown in 


Dmitruk and Kuz’kinal (|2005ll . Even under the two key changes described above the sufficient conditions in 
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Dmitruk and Kuz’kinal (|2005ri continue to hold. Therefore, the optimal policy as a function of time exists and 


n*(a;, 77 ) is well defined. We use this to show the existence and uniqueness of the optimal policy as a function 
of user base (*{■)■ We need to modify the part 1 of Assumption 1 as follows c (0) < w{x) < c (A““^). 

This modification ensures that the optimal policy is in the interior of If*(a:, 77 ) has to satisfy the 

HJB equation and the optimal policy as a function of user base will be the maximizer of the RHS of the 
HJB equation given as c'~^ w(x)). This proves Proposition 1. 

Since we already showed the existence of the optimal policy as a function of time, we need to show 
uniqueness next. Basically we need to show that ^ = f{x)w{x) + c'~^ w[x))w{x) — x for initial 

value Xinitiai has a unique solution. We need to modify assumption 2 as follows. Define h^{x) = f{x)w{x) — 
the assume that h^ix) is continuously differentiable. c'~^( w{x))w{x) can be argued to 

be continuously differentiable based on w{x) and continuous diffe rentiability. Thus we can use the 

results from theory of ODEs (in the same way as we did in Appendix A) ICoddington and LevinsonI ([l955|) 
to show the uniqueness. 

In Theorem 1 we showed the existence of the steady state. Note that in this case as well we can show 
that the user base trajectory is bounded above. This is because the f{x)w{x) is bounded above by and 
c'-i( ‘^^f’'^"' w{x))w{x) is bounded above by A®"^. Therefore, the rate of change of user base is bounded by 


rx. Every trajectory that has a rate of change given by + A®"^ — ; 


tX is bounded. 


dx 

/ sup I \sup 

/\ -7—r tiy . j—; VV.X y ui.ct|v./v^uv. 7 x y uaxcau j.xcxo ct j. cxuv./ wx v v.^xx i_/ y / I ; 7—r.. 

77 + 775 ( 0 :) ^ J ^ o o J J riJ^Tjg(^x) 

Recall that we had the assumption that Xi^itiai < which results in h^XinUiai) > 0. This assumption was 

used in proving that the steady state exists. We can modify this assumption to Xinuiai < i+- Under this 

modified assumption the rate of change of user base hg{xinitiai) > 0. In order to show that the steady state 
exists we need to show that f{x)w{x)+c'~^ w{x))w{x) — x = 0 has a solution. We already know 

that f{x)w{x) + c'-'^{ ‘^^fj’^'> w{x))w{x) - j^::^x>0 when x = x^uai- f{x)w{x) + c'~^( w{x))w{x) - 
7 X at x = Xu has to be negative. Because f{x)w{x) + c'~^ ( 'w{x))w{x) — -, ^ x is continuous it 


- 1 / \ <JU CtU 4Xj tXjq, XlCArO LiV-7 V V.... V.y V^Ct U.C3\-- f \ ! \JU \ ! U- \ , UU \ <JU ! I iXJ \ .JU ! - . / \ ‘ 

77+775(0;) “ C’ J \ \ / vdo:A//\/ 77+775(0:) 

will have a root in the interval [xiniuai , Xu] ■ This proves the existence of the steady state. 

In Theorem 2 we showed the convergence of the policy to the steady state. We can replicate the same 
steps as given in Appendix C and prove convergence 

In Theorem 3 we showed that the optimal policy will be decreasing as a function of time. Here, it is not 
necessary that the optimal policy will decrease with time. This is because the optimal policy as a function 
of user base is proportional to and does not necessarily decrease with user base. 

We will show that jg concave, which will imply that is decreasing. But w{x) is increasing. 

Therefore, if the percentage decrease in jg greater than the percentage increase in w{x) then the 

optimal policy decreases otherwise it increases. 

Next, we discuss the impact on Theorem 4. We can write the HJB equation for this case as follows. 


TT*7 \ ui \ , ^^n*(x,77) an*(x,7?) +n*(x,77) 

pH (x, 77 ) = h{x) - c{q{ --- w(x))) H --- w{x)[f[x) + q{ --- u;(x)) - 


I 


dx '' ' dx \ \ j • \ n w{x)(f] + r]g(x)) 

As pointed out above that the optimal policy is proportional to y{x,r]) = ^2-^^'u;(x). We take the 
derivative of the above HJB equation w.r.t 77 to obtain the following expression. 
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^ dU {x,r]) ^ + g(y(x))- rTT^. - T^^:) + y{x)q 


drj 
+y{x)- 


dr] drj 


w{x){fi + yg{x))' 


drj 


g{x)3 


w{x){fj + gg{x)Y 
Rearranging the above equation we get the following 

g{x)x dy{x) 


aiR^_ 

dg w(x){fi+ gg{x))^ 

+y{x)q'{y{x))) 


dg 


i-c'iq'iyix))) + fix) + qiyix)) - 


1 


wix)ig + ggix))‘ 


Now we can carry out an analysis on the same lines as shown in Appendix E. Basically, a sufficiently low 
and high discount rate p will lead to a negative or a positive LHS. The sign of the term (—c'(g'(t/(x))) + 
f{x) + qiyix)) — + yix)q (y(a:))) will depend on the size of the user base. This way we can 

prove Theorem 4. 

Next, we discuss how Theorems 5 and 6 change. For Theorems 5 and 6 we had arrived at the steady state 
equations using the concavity of the value function. In this case as well we can show that the value function 
is concave in a manner similar to Lemma 1. We need to assume that bix) — — fix) + 

is jointly co ncave in ix,^) where x is r estric ted to be in [xinitial, Xy]. Using the sufficient conditions for 
concavity in iBenveniste and ScheinkmanI (|l979l) we thus have n*(a:, g) to be concave. We can therefore derive 


the steady state equation for this case as follows. 

wixs) p 

In the above h^ixs) = fix) — The above equation can be analyzed in the similar manner as 

in Appendix F. The analysis will show that the heterogeneity in the benefit function determines the impact 
of network effects. 

Appendix I: Stochastic arrival/exits 

In this section we allow for uncertainty in the arrivals and exits of users. This modifies the user user base 
differential equation to the following stochastic differential equation, dXt = (0 + A(t) — XdX)dt + aXdWt- 
iXd = ■^)- The HJB equation corresponding to this case is 

pli-ix) = max(r^ ~ (T — x)'^ — cX^ + Il'ix)i0 + X — X^x) + in”(x)CT^) 

A ^ 

In the above setting we allow for negative referrals for analytical tractability, but we do not allow that the 
user base becomes negative. Hence, the optimal A should ensure that the user base is always positive. Solving 
the optimal A we get A = . Let’s assume that n(x) = A'x^ + B'x + C' and substitute A = 

in the above HJB equation. Equating coefficients on both the sides we get A' = ^ +^(c) 

/ / ^ c 

and B' = 2 —Note that A — Ad < 0 and 9 + -^ > 0, these conditions together with the fy t that 
{p+^d—A~'l 

the initial user base a;(0) > 0 ensure that the user base dynamic stays above zero Isee IZhad (|2009ll l. The 


e+4 


expected user base at a time t is given as E[xit)] = 

-'d 

number of referrals are given as if[A(t)] = ^ _ 


The expected 
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If a;(0) < then the expected user base E[x{t)\ dynamic increases with time and converges to attain 


e+4 


^. The corresponding trajectory of expected number of referrals will decrease with time, this is because 
the referrals decrease with increase in user base 

We know that A = — X + -f-. We know that B = 2 ^ / and from the expression observe that if A 

increases B increases as well. 


We next analyze the impact of change in network effects on the optimal policy. We can compute = 


^^d-‘rP—<r 


= ). Observe that € [0,1]. We can also obtain the expression for — 


dB 
dXd 


Ad 

1 8 a' 
aXd 


c.(l —■ , - 

•\/(2Ad+p-o-2)2+4_ ' 

X6 + T{—1 + Ix^). Observe that the first term (p + Ad)0|^ > 0 and the second term r(—1 + |^) < 0. If T is 
sufficiently high then <0. If T is sufficiently low then < 0. Hence, we can see that if F is sufficiently 
low then both A and B increase with an increase in A^. Therefore, for a sufficiently low T the optimal 
policy as a function of the size of the user base given as + -l^ is an increasing function of A^. If T is 
sufficiently large then for sufficiently small size of the user base the optimal policy —X + ^ decreases, while 


for sufficiently large sizes of the user base the optimal policy — X- 


increases. 


We also want to compare the optimal policy in the steady state. Let us denote the expected population 
level in the steady state as X^- Therefore the optimal policy in the steady state is given as ^Xg + We 
now see how —Xg + -f- changes as A^ increases. = ^-Xg + A! ^-Xg. We know that 

I^Xs > 0, > 0 (because < 0 and < 0) and > 0. Therefore, increasing Ad will increase the 

advertisements and referrals in steady state. 

We prove Theorem 7 and 8 next. Next, we show that as a increases then A^ decreases. 


dA t ^ , (2Ad + p —cr^) 

^(2Ad + p-aT+4(i) 


/ / 

Observe that < 0. We know that B increases with an increase in A . Therefore, < 0. Therefore, 

dcr ’ da ’ 

with increase in a the will be a decreasing function of a. 

Q I B 

For Theorem 8 we need to compare the steady state user base level, which is ——This expression is 
positive and decreases with cr as H and B decrease with a. On the same lines we can compare the value 
functions for the result on profits. We obtain C' in the same way as A' ,B' and it is given a.s C' = B' 9 + 

C' decreases with a as B' decreases with a. This combined with the definition of the value function gives 
the result. 

Appendix J: Concavity of the benefit function 

In this section we will show that if the users arrive in the decreasing order of their preference then the benefit 
generated for the online firm is a concave function that depends on the size of the user base. 

Let the potential user base be a set S = [0,oo), where each member of the set corresponds to a potential 
user. Also, assume that length of any interval of S will denote the size of the user base that lies in that 
interval. Let s be a potential user with a preference for the online firm pr(s), where we assume without 
loss of generality that pr{s) is a decreasing function in s. We also assume that the benefit that user s will 
generate is proportional to pr{s). Let us construct another preference function that samples pr[s) and looks 
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like a staircase, which is defined as pr(s). Divide the domain into intervals of length As and pr(s) =pr(si), 
where S; corresponds to the left end point of the interval to which s belongs to. In the analysis that follows 
we will assume that preference of the users follows pr{s). We will show that our result will hold for every 
As. We know that pr{s) = limAs^oP*^(s), which will extend our result to pr{s). 

Consider the interval [0,As]. We know that the user arrivals happen in the order of the preference. All 
the users in the interval [0,As] have the same preference ^^(O) and p'r(O) is the highest preference. It is 
clear that initially users from interval [0, As] will only enter the online firm. Let the population of the users 
with preference pr{0) at time t be given as pi{t), where we use subscript 1 with population to denote that 
these users correspond to the first interval [0, As]. The rate at which the population will change is given as 
^ = /(xi) + X{t) — as long as the population is less than As. The benefit generated by these 

users is given as b{p) = pr(s)ds, where p is the size of the user base that has entered and is less than 
As. If As is sufficiently small, then the population trajectory of Xi(t) will achieve As level in a certain 
finite time. When Xi(t) achieves As then the users will still leave the user base at the rate As. 

The total direct arrival rate is /(As) and a fraction of direct arrivals will nullify the exits of the users 
ij+yg(As) remaining direct arrivals will bring in the users from the interval [As,2As]. Note that 

the users from the next interval only begin entering after the users from the previous interval have already 
entered. Also, the size of the user base from the interval [0,As] remains fixed to As. The users from the 
interval [As, 2As] will enter at the rate ^ = f{x 2 + As) + X{t) — (As + X 2 ) and ^ = 0. The 

total rate of change of population ^ = /(xj + As) + X(t) - (As + Xj) = /(x) + X{t) - (x), 

where p is the total size of the user base. The benefit that is generated is given as b{x) = fjpr(s)ds. This 
argument can be generalized to all the intervals. Note that b(x) is a concave function because pr(s) is a 
decreasing function. We also know that the above argument holds for any As. Therefore, we can say that 
b(x) =limAs^o Jq px{s)ds = \imj\s^oP'r{s)ds = pr(s)ds. The second equality in the above follows from 
monotone convergence theorem. Therefore, b(x) depends only on x and is concave. 
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